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TALAROMYCES EMERSONII BETA-GLUCANASES 

NOVEL GLUCANASES 
Field of Invention 

The present invention relates to novel (B-)glucanases (and so also to cellulases) from the 
fungus Talaromyces, in particular Talaromyces emersonii, and their use in degrading glucan in 
cellulose. 

Background of the invention 

The composition of a plant cell wall is complex and variable. Polysaccharides are mainly 
found in the form of long chains of cellulose (the main structural component of the plant cell 
wall), hemicellulose (comprising various B-xylan chains, such as xyloglucans) and pectin. 

Beta-glucan (or, more properly, (l~>3)(l-4)fi-D-glucan), MW 10 to 14 kDa, is a component 
of plant hemicellulose (MW of about 700 kDa) and consists of a backbone of B-1,4 and 1,3- 
linked glucopyranose residues. Another component is xylan which consists of fl-l,4-linked 
xylose residues, optionally substituted with side chains such as arabinose and/or glucuronic acid 
residues. 

Basic differences exist between monocotyledons (e.g. cereals and grasses) and dicotyledons 
(e.g. clover, rapeseed and soybean) and between the seed and vegetative parts of the plant. 
Monocotyledons are characterized by the presence of an arabinoxylan complex as the major 
hemicellulose backbone, and the main structure of hemicellulose in dicotyledons is a xyloglucan 
complex. Higher pectin concentrations are found in dicotyledons than in monocotyledons. Seeds 
are generally high in pectic substances but relatively low in cellulosic material. 

Cellulose degrading enzymes are used for the processing, of plant material in food as well as 
feed applications or as a food or feed additive due to of their capability to act on main plant cell 
wall substituents. 

Most of the cellulose degrading enzymes available to the industry appear to be glucanases 
with a relatively low molecular weight and a moderate stability at higher temperatures. 
However, for certain applications it is desirable to use a glucanase with a relatively high 
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thermostability. If a glucanase is to be used as an animal feed additive then a high thermostability 
is preferred because of the high temperature conditions applied during pelleting the animal feed. 

Summary of the Invention 

Novel B-glucanases (or cellulases) are now provided which are able to cleave B-D-glucan (or 
cellulose) such as present in plant material. 

At its broadest, the invention relates to B-glucanases (or cellulases) from Talaromyces, such 
as Talaromyces emersonii (a fungus). These B-glucanases can cleave B-D-glucan or cellulose, 
for example they are fl-l,4-endoglucanases, or have the activity EC 3.2. L4. 

The glucanases may have: 

a. an optimum pH below 7.0 or 5.4, such as below 5.0, for example from 4.4 to 5.2 or 
from 3.0 to 6.0 or 7.0; or 

b. a temperature optimum of at least 72°C, 75°C or even 8 1°C, such as from 78°C to 85°C 
or from 83°C to 87°C. 

More specifically, the present invention provides an (isolated) B-glucanase polypeptide 
comprising: 

(i) the amino acid sequence of SEQ ID No: 2,4 or 6; or t . " "; 

(ii) a variant of (i) which is capable of cleaving B-D-glucan; or 

(iii) a fragment of (i) or (ii) which is capable of cleaving B-D-glucan. 

According to another aspect of the invention there is provided a polynucleotide which 
comprises: 

(a) the nucleic acid sequence of SEQ ID No. 1, 3 or 5, or a sequence encoding a 
polypeptide of the invention; 

(b) a sequence which is complementary to, or which hybridises to, any sequence as 
defined in (a); 

(c) a fragment of any sequence in (a) or (b); 

(d) a sequence having at least 60% identity to any sequence as defined in (a), (b) or (c); or 

(e) a sequence that is degenerate as a result of the genetic code to any of the sequences as 
defined in (a) to (d). 

The invention also provides: 

an (eg. expression) vector which comprises a polynucleotide of the invention and 
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which may be capable of expressing a polypeptide of the invention; 
a cell line or strain comprising a vector of the invention; 

a method of producing a polypeptide of the invention which method comprises :~ 
maintaining a cell line or strain of the invention under conditions suitable for 
obtaining expression of the polypeptide and, if necessary, isolating the 
polypeptide; 

a method of degrading B-D-glucan, the method comprising contacting a material 
comprising B-D-glucan with a polypeptide of the invention; 
a method for identification of a compound that modulates B-glucanase activity, 
which method comprises contacting a polypeptide of the invention with a test 
compound in the presence of fi-D-glucan and monitoring for or detecting any 
modulation of activity; 

a method of treating a subject having hyperlipemia which method comprises 
administering to the subject an effective amount of the polypeptide of the 
invention; and 

use of the polypeptide in the manufacture of a medicament for the treatment or, 
prophylaxis of hyperlipemia. 

Brief Description of the Sequences 

SEQ ID No. 1 is a DNA sequence of a first B-glucanase (CEA) of the invention from 
Talaromyces emersonii; 

SEQ ID No. 2 is the amino acid sequence of the first B-glucanase, CEA; 
SEQ ID No. 3 is a DNA sequence of a second B-glucanase (CEB) from the same organism; 
SEQ ID No. 4 is the amino acid sequence of the second B-glucanase, CEB; 
SEQ ID No. 5 is a DNA sequence of a third fi-glucanase (CEC) also from the same organism; 
SEQ ID No. 6 is the amino acid sequence of the third B-glucanase, CEC; and 
SEQ ID Nos. 7 and 8 are PCR primers that hybridize to SEQ ID No. 1 (and were used to redone 
cDNA inserts during genetic analysis of positive transformants). 

Detailed Description of the Invention 
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A. Polynucleotides . 

The present invention provides a (e.g. isolated and/or purified) polynucleotide encoding 
polypeptides of the invention. The present invention thus provides a polynucleotide encoding a 
fi-glucanase whose amino acid sequence is set out in SEQ ID No. 2, 4 or 6. The present 
invention further provides a polynucleotide encoding a polypeptide having substantial amino 
acid sequence homology to the amino acid sequence set out in SEQ ID No. 2, 4 or 6. Also 
included is a polynucleotide selected from: 

(a) a polynucleotide comprising the nucleotide sequence set out in SEQ ID No. 1, 3 or 5, 
or the complement thereof; 

(b) a polynucleotide comprising a nucleotide sequence capable of (e.g. selectively) 
hybridising to a nucleotide sequence set out in SEQ ID No. 1, 3 or 5, or a fragment 
thereof; 

(c) a polynucleotide comprising a nucleotide sequence capable of (e.g. selectively) 
hybridising to the complement of the nucleotide sequence set out in SEQ ID No. 1, 3 
or 5 ? or a fragment thereof; and/or 

(d) a polynucleotide comprising a polynucleotide sequence that is degenerate as a result of 
the genetic code to a polynucleotide defined in (a), (b) or (c). 

A polynucleotide of the invention also includes a polynucleotide which: 

(a) encodes a polypeptide having fl-glucanase activity, which polynucleotide is: 

(1) the coding sequence of SEQ ID No. 1, 3 or 5; 

(2) a sequence which hybridises selectively to the complement of sequence defined in 
(l);or 

(3) a sequence that is degenerate as a result of the genetic code with respect to a 
sequence defined in (1) or (2); or 

.(b) is a sequence complementary to a polynucleotide defined in (a). 

Hybridisable sequences 

The term "capable of hybridizing" means that the target polynucleotide of the invention can 
hybridize to the nucleic acid used as a probe (for example the nucleotide sequence set out in 
SEQ. ID No.l, 3 or 5 , or a fragment thereof-or the complement thereof) at a level significantly 
above background. The invention also includes nucleotide sequences that encode for fi-glucanase 
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or variants thereof as well as nucleotide sequences which are complementary thereto. The 
nucleotide sequence may be RNA or DNA and thus includes genomic DNA, synthetic DNA or . 
cDNA. Preferably the nucleotide sequence is a DNA sequence and most preferably, a cDNA 
sequence. Typically a polynucleotide of the invention comprises a contiguous sequence of 
nucleotides which is capable of hybridizing under selective conditions to the coding sequence or 
the complement of the coding sequence of SEQ ID No. 1, 3 or 5 , as appropriate. Such 
nucleotides can be synthesized according to methods well known in the art 1 . 

A polynucleotide of the invention can hybridize to the coding sequence or the complement 
of the coding sequence of SEQ ID No. 1, 3 or 5 (as appropriate) at a level significantly above 
background. Background hybridization may occur, for example, because of other cDNAs 
present in a cDNA library. The signal level (eg. generated by the interaction between a 
polynucleotide of the invention and the coding sequence or complement of the coding sequence 
of SEQ ID No. 1, 3 or 5) is typically at least 10 fold, preferably at least 100 fold, as intense as 
interactions between other polynucleotides and the coding sequence of SEQ ID No. 1, 3 or 5. 
The intensity of interaction may be measured, for example, by radiolabelling the probe, e.g. with 
32 P. Selective hybridization may typically be achieved using conditions of low stringency (0.3M 
sodium chloride and 0.03M sodium citrate at about 40°C), medium stringency (for example, 
0.3M sodium chloride and 0.03M sodium citrate at about 50°C) or high stringency (for example, 
0.3M sodium chloride and 0.03M sodium citrate at about 60°C). Hybridization may be carried 
out under any suitable conditions known in the art 1 arid, as a guide, low stringency can be 
2 x SSC at 55°C, medium stringency can be 0.5 to 1 .0 x SSC at 60°C and high stringency can be 
0.1 or 0.2 x SSC at 60°C or higher (e.g. at 68°C), all at 0.5% SDS. 

Modifications 

Polynucleotides of the invention may comprise DNA or RNA. They may be single or 
double stranded. They may also be polynucleotides which include within them one or more 
synthetic or modified nucleotides. A number of different types of modifications to 
polynucleotides are known in the art. These include methylphosphonate and phosphorothioate 
backbones and/or addition of acridine or poly lysine chains at the 3 ' and/or 5 5 ends of the 
molecule. For the purposes of the present invention, it is to be understood that the 
polynucleotides described herein may be modified by any method available in the art. 
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It is to be understood that skilled persons may, using routine techniques, make nucleotide 
substitutions that do not affect the polypeptide sequence encoded by the polynucleotides of the 
invention to reflect the codon usage of any particular host organism, for example in which the 
polypeptides of the invention are to be expressed. , * 1 

The coding sequence of SEQ ID No. 1, 3 or 5 may be modified by nucleotide substitutions, 
for example from or up to 1, 2 or 3 to 10, 25, 50 or 100 substitutions. The polynucleotide may 
alternatively or additionally be modified by one or more insertions and/or deletions and/or by an 
extension at either or both ends. The modified polynucleotide generally encodes a polypeptide 
which has B-glucanase activity. Degenerate substitutions may be made and/or substitutions may 
be made which would result in a conservative amino acid substitution when the modified 
sequence is translated, for example as discussed with reference to polypeptides later. 

! 

Homologues 1 

A nucleotide sequence which is capable of selectively hybridizing to (e.g. the complement 
of) the DNA coding sequence of SEQ ID No. 1, 3 or 5 may have at least 50% or 60%, at least 
70%, at least 80%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity (or 
homology) to the coding sequence of SEQ ID No. 1, 3 or 5. This may be over a region of at least 
20, preferably at least 30, for instance at least 40, at least 60, more preferably at least 100 
contiguous nucleotides or optimally over the full length of SEQ ID No. 1, 3 or 5. For SEQ ID 
No. 1, the sequence identity is preferably at least 75% or 80%, for SEQ ID No. 3 preferably at 
least 85%, and for SEQ ID No. 5 preferably at least 85%. 

Any combination of the above mentioned degrees of homology and minimum sized may be 
used to define polynucleotides of the invention, with the more stringent combinations (i.e. higher 
homology over longer lengths) being preferred. Thus for example a polynucleotide which is at 
least 80% or 90% homologous over 25, preferably over 30 nucleotides forms one aspect of the 
invention, as does a polynucleotide which is at least 90% homologous over 40 nucleotides. 

Homologues of polynucleotide (or protein) sequences typically have at least 70% homology, 
preferably at least 80, 90%, 95%, 97% or 99% homology, for example over a region of at least 
15, 20, 30, 100 more contiguous nucleotides (or amino acids). The homology may calculated on 
the basis of amino acid identity (sometimes referred to as "hard homology"). 

For example the UWGCG Package provides the BESTFIT program which can be used to 
calculate homology (for example used on its default settings 5 ). The PILEUP and BLAST - 
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algorithms can be used to calculate homology or line up sequences (such as identifying 
equivalent or corresponding sequences, for example on their default settings 6 ' 7 ). / ; 

Software for performing BLAST analyses is publicly available through the National Center 
for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first 
identifying high scoring sequence pair (HSPs) by identifying short words of length W in the 
query sequence that either match or satisfy some positive-valued threshold score T when aligned 
with a word of the same length in a database sequence. T is referred to as the neighbourhood 
word score threshold 6 * 7 . These initial neighbourhood word hits act as seeds for initiating searches 
to find HSPs containing them. The word hits are extended in both directions along each 
sequence for as far as the cumulative alignment score can be increased. Extensions for the word 
hits in each direction are halted when: the cumulative alignment score falls off by the quantity X- 
from its maximum achieved value; the cumulative score goes to zero or below, due to the 
accumulation of one or more negative-scoring residue alignments; or the end of either sequence 
is reached. The BLAST algorithm parameters W, T and X determine the sensitivity and speed of 
the alignment. The BLAST program uses as defaults a word length (W) of 1 1, the BLOSUM62 
scoring matrix 8 alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both 
strands. 

The BLAST algorithm performs a statistical analysis of the similarity between two 
sequences 9 . One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)X which provides an indication of the probability by which a match between 
two nucleotide or amino acid sequences would occur by chance. For example, a sequence is 
considered similar to another sequence if the smallest sum probability in comparison of the first 
sequence to the second sequence is less than about 1, preferably less than about 0.1, more 
preferably less than about 0.01, and most preferably less than about 0.001 . 

Primers and Probes 

Polynucleotides of the invention include and may be used as a primer, e.g. a PCR primer, a 
primer for an alternative amplification reaction, a probe, or the polynucleotides may be. cloned 
into vectors. Such primers, probes and other fragments will be at least or up to 20, for example 
at least 25, 30 or 40 nucleotides in length. They will typically be up to 40, 50, 60, 70, 100, 150, 
200 or 300 nucleotides in length, or even up to a few nucleotides (such as 5 or 10 nucleotides) 
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short of the coding sequence of SEQ ID No. 1, 3 or 5. 

In general, primers will be produced by synthetic means, involving a step-wise manufacture 
of the desired nucleic acid sequence one nucleotide at a time. Techniques for accomplishing this 
using automated techniques are readily available in the art. Examples of primers of the invention 
are set out in SEQ ID Nos 7 and 8. 

Longer polynucleotides will generally be produced using recombinant means, for example 
using PCR (polymerase chain reaction) cloning techniques. This will involve making a pair of 
primers (e.g. of about 15-30 nucleotides) to a region of the B-glucanase which it is desired to 
clone, bringing the primers into contact with mRNA or cDNA obtained from a target (e.g yeast, 
bacterial, plant, prokaryotic or fungal) cell, preferably of an Talaromyces strain, performing a 
polymerase chain reaction under conditions which bring about amplification of the desired 
region, isolating the amplified fragment (e.g. by purifying the reaction mixture on an agarose gel) 
and recovering the amplified DNA. The primers may be designed to contain suitable restriction 
enzyme recognition sites so that the amplified DNA can be cloned into a suitable cloning vector. • 

Such techniques may be used to obtain all or part of the fl-glucanase sequence described 
herein. Genomic clones corresponding to the cDNA of SEQ ID No. 1, 3 or 5 or the B-glucanase 
gene containing, for example, introns and promoter regions are within the invention also and 
may also be obtained in an analogous manner (e.g. recombinant means, PCR, cloning 
techniques), starting with genomic DNA from a fungal, yeast, bacterial plant or prokaryotic cell. 

The polynucleotides or primers may carry a revealing label, e.g. a radioactive or 
non-radioactive label. Suitable labels include radioisotopes such as 32 P or 35 S, enzyme labels, or 
other protein labels such as biotin. Such labels may be added to polynucleotides or primers of 
the invention and may be detected using techniques known per se. 

Polynucleotides, labelled or unlabelled may be used in nucleic acid-based tests for detecting 
or sequencing B-glucanase or a variant thereof in a (e.g. fungal) sample. Such tests for detecting 
generally comprise bringing a (e.g. fungal) sample (suspected of) containing DNA into contact 
with a probe or primer of the invention under hybridizing conditions and detecting any duplex 
formed between the probe and nucleic acid in the sample. Such detection may be achieved using 
techniques such as PCR or by immobilizing the probe on a solid support, removing nucleic acid 
in the sample which is not hybridized to the probe, and then detecting nucleic acid which was 
hybridized to the probe. Alternatively, the sample nucleic acid may be immobilized on a solid 
support, and the amount of probe bound to such a support can be detected. 



.0170998A1J_> 



WO 01/70998 



9 



PCT/EPO 1/03 174 



The probes of the invention may conveniently be packaged in the form of a test kit in a 
suitable container. In such kits the probe may be bound to a solid support where the assay format 
for which the kit is designed requires such binding. The kit may also contain suitable reagents 
for treating the sample to be probed, hybridizing the probe to nucleic acid in the sample, control 
reagents, instructions, and the like. 

Preferably, the polynucleotide of the invention is obtainable from the same organism as the 
polypeptide, such as a fungus, in particular a fungus of the genus Talaromyces. 

The polynucleotides of the invention also include variants of the sequence of SEQ ID No. 1, 
3 or 5 which have B-glucanase activity. Variants may be formed by additions, substitutions 
and/or deletions and may have the ability to cleave a fi-D-glucan polymer. 

Production of polynucleotides 

Polynucleotides which do not have 100% identity with SEQ ID No. 1, 3 or 5 but fall within 
the scope of the invention can be obtained in a number of ways. Thus variants of the B-glucanase 
sequence described herein may be obtained for example by probing genomic DNA libraries 
made from a range of organisms, for example those discussed as sources of the polypeptides of 
the invention. In addition, other fungal, plant or prokaryotic homologues of 6-glucanase may be 
obtained and such homologues and fragments thereof in general will be capable of hybridising to 
SEQ ID No. 1, 3 or 5. Such sequences may be obtained by probing cDNA libraries or genomic 
DNA libraries from other species, and probing such libraries with probes comprising all or part 
of SEQ ID. 1, 3 or 5 under conditions of medium to high stringency (as described earlier). 
Nucleic acid probes comprising all or part of SEQ ID No. 1, 3 or 5 may be used to probe cDNA 
libraries from other species, such as those described as sources for the polypeptides of the 
invention. 

Species homologues may also be obtained using degenerate PCR which will use primers 
designed to target sequences within the variants and homologues encoding conserved amino acid ' 
sequences. The primers can contain one or more degenerate positions and will be used at 
stringency conditions lower than those used for cloning sequences with single sequence primers 
against known sequences. 

Alternatively, such polynucleotides may be obtained by site directed mutagenesis of the 
6-glucanase sequences or variants thereof. This may be useful where for example silent codon 
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changes are required to sequences to optimise codon preferences for a particular host cell in 
which the polynucleotide sequences are being expressed. Other sequence changes may be 
desired in order to introduce restriction enzyme recognition sites, or to alter the property or 
function of the polypeptides encoded by the polynucleotides. 

The invention includes double stranded polynucleotides comprising a polynucleotide of the 
invention and its complement. 

The present invention also provides polynucleotides encoding the polypeptides of the 
invention described below. Since such polynucleotides will be useful as sequences for 
recombinant production of polypeptides of the invention, it is not necessary for them to be 
capable of hybridising to the sequence of SEQ ID No. 1, 3 or 5, although this will generally be 
desirable. Otherwise, such polynucleotides may be labelled, used, and made as described above 
if desired. 

B. Polypeptides . 

The present invention relates to (e.g. (substantially) purified and/or isolated) B-glucanases 
(or cellulases) and variants thereof. The polypeptides of the invention may consist essentially of 
the amino acid sequence of SEQ ID No. 2, 4 or 6 or of a variant of that sequence. Polypeptides 
may also be encoded by a polynucleotide of the invention as described above. 

A polypeptide of the invention may be in an isolated or a substantially purified form. It will 
be understood that the polypeptide may be mixed with carriers or diluents which will not 
interfere with the intended purpose and/of function of the polypeptide and still be regarded as 
substantially isolated. It will generally comprise the polypeptide in a preparation in which more 
than 20%, e.g. more than 30%, 40%, 50%, 80%, 90%, 95% or 99%, by weight of the polypeptide 
in the preparation is a polypeptide of the invention. Routine methods can be employed to purify 
and/or synthesise the proteins according to the invention 1 . For some formulations (e.g". for 
non-pharmaceutical uses) the amount of polypeptide present may be small, for example from 
0.01 to 10%>, such as from 0. 1 to 5%, or 2% or even from 0.2 to 1%. 

Preferably, the polypeptide of the invention is obtainable from a microorganism which 
possesses a gene encoding an enzyme with fl-glucanase (or cellulase) activity. More preferably 
the microorganism is fungal, and optimally a filamentous fungi. Preferred organisms are thus of 
the genus Talaromyces, such as of the species Talaromyces emersonii. 
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Activity 

A polypeptide of the invention can have one or more of the following features, namely it: 

(1) possesses U-glucanase (or cellulase) activity; 

(2) has an optimum pH range of from 3 to 6.5 or 7.0, such as from 4 to 5.5 or 6.0, 
optimally from 4.5 to 5.0; 

(3) has optimum activity at a temperature of from 30°C to 100°C, such as 60 to 95°C, 
optimally from 75°C or 80°C to 90°C. Preferably the optimum temperature is at least 
75°C, such as at least 85°C; 

(4) has a molecular weight (deglycosylated) of from 20 to 60 kDa, preferably from 20 to 
25, 35 to 45 or 23 to 50 kDa, optimally from 36 to 40 kDa or (glycosylated) or from 40 
to 45kDa; 

(5) has an isoelectric point of from 3.0 to 3.6 or 5.0, 3.8 to 4.5, 4.5 to 5.0 or from 6.0 to 
7.0; and/or 

(6) possesses hydrolytic activity on cereal J3-glucan or exhibits hydrolytic activity below 
pH 7. 

The polypeptide can have the activity of EC.3.2.1.4 (or endoglucanase activity). In general 
this can be endohydrolysis of 1,4-B-D-glucosidic linkages in cellulose. "B-glucanase activity" is 
the ability to cleave cellulose or a B-glucan polymer (for example as found in plants e.g. oat or 
barley). The activity thus allows cleavage between adjacent glucopyranose terminal and/or non- 
terminal units. Preferably the cleavage occurs at a [glucose (1-4), (1-3) or (1-6) glucose] linkage. 
The polypeptide may preferentially cleave in between two adjacent (e.g. non-substituted 
glucose) units. It can thus have endo activity (i.e. be an endoglucanase). The substrate polymer 
may or may not be substituted. Preferably the polypeptide will not have xylanase activity. 

The polypeptide may also have the cellulase activity, that is to say it is active on, or can 
cleave, cellulose. As B-glucan is a component of cellulose, all glucanases fall within the broader 
term of cellulases. The polypeptides of the present invention are therefore cellulases because 
they belong in the sub-class of glucanases. Other types of classes of activity within cellulases, 
apart from endoglucanase (EC 3.2.1.4, as mentioned above) are exo-glucanase/cellobiohydrolase 
(EC 3.2.1.91), B-glucosidase (EC 3.2.1.21), endo-l,6-glucanase (EC 3.2.1.75), exo-l,3-glucanase 
(EC 3.2.1.58), mannanase (EC 3.2.1.78) and endo-glycoceramidase (EC 3.2.1.123).. Some of 
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the polypeptides are thought to have one, two or more of these additional activities based on their 
structure and sequence, by comparison with known enzymes and the family of hydrolases into . 
which they fall. Thus, in the specification, where the context is appropriate, glucanase activity 
can mean cellulase activity. A list of (cellulase) activities for the polypeptides is provided in 
Example 1 1 later. 

Preferably the polypeptide falls into one of the families 5, 7 or 45, according to the 
glycoside; hydrolase (CAZy) classification. The polypeptide may be a glu/glu or asp/asp 
nucleophile/proton donor. 

Variants and Homologues 

A polypeptide of the invention can comprise the amino acid sequence set out in SEQ ID No. 
2, 4 or 6 or a substantially homologous sequence, or a fragment of either sequence and can have 
B-glucanase activity. In general, the naturally occurring amino acid sequence shown in SEQ ID 
No. 2, 4 or 6 is preferred. 

In particular, the polypeptide of the invention may comprise: 

a. the polypeptide sequence of SEQ ID No. 2, 4 or 6; 
- b. a naturally occurring variant or species homologue thereof; or 

c. a protein with at least 70, at least 80, at least 90, at least 95, at least 98 or at least 99% 
sequence identity to (a) or (b). 

A variant may be one that occurs naturally, for example in fungal, bacteria, yeast or plant 
cells and which can function in a substantially similar manner to the protein of SEQ ID No. 2, 4 
or 6, for example it has B-glucanase activity. Similarly a species homologue of the protein will 
be the equivalent protein which occurs naturally in another species and which can function as an 
B-glucanase enzyme. Variants include allelic variants either from the same strain as the 
polypeptide of the invention or from a different strain, but of the same genus, or of the same 
species. 

Variants and species homologues can be obtained by following the procedures described 
herein for the production of the polypeptide of SEQ ID No. 2, 4 or 6 and performing such 
procedures on a suitable cell source, for example a bacterial, yeast, fungal or plant cell. It will 
also be possible to use a probe as defined above to probe libraries made from yeast, bacterial, 
fungal or plant cells in order to obtain clones including the variants or species homology. The 
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clones can be manipulated by conventional techniques to generate a polypeptide of the invention 
which can then be produced by recombinant or synthetic techniques known per se. 

The polypeptide of the invention preferably has at least 70% sequence identity to the protein 
of SEQ ID No. 2, 4 or 6, more preferably at least 80%, at least 90%, at least 95%, at east 97% or 
at least 99% sequence identity thereto, for example over a region of at least 60, at least 100, 150, 
200 or 300 contiguous amino acids or over the full length of SEQ ID No. 2, 4 or 6. For SEQ ID : 
No. 2, the sequence identity is preferably at least 70%, for SEQ ID No. 4 at least 60%, and for 
SEQ ID No. 6 at least 65%. , y 

The sequence of the polypeptide of SEQ ID No. 2, 4 or 6 and of variants and species 
homolbgues can thus be modified to provide polypeptides of the invention. Amino acid 
substitutions may be made, for example from or up to 1, 2 or 3 to 10, 20, 30, 50 or 100 
substitutions. The same number of deletions and insertions may also be made. These changes 
may be made outside regions critical to the function of the polypeptide and so may still result in 
an active enzyme. The modified polypeptide generally retains activity as a B-glucanase. 

Polypeptides of the invention include fragments of the above mentioned full length 
polypeptides and of variants thereof, including fragments of the sequence set out in SEQ ID No. 
2, 4 or 6. Such fragments typically retain activity as a B-glucanase. Fragments may be at least 
50, 100, 150, 200 or 250 amino acids long or may be this number of amino acids short of the full 
length sequence (as shown in SEQ ID No. 2, 4 or 6). 

Polypeptides of the invention can if necessary be produced by synthetic means although 
usually they will be made recombinantly as described below. They may be modified for 
example by the addition of histidine residues or a T7 tag to assist their identification or 
purification or by the addition of a signal sequence to promote their secretion from a cell. 

The term "variants" refers to polypeptides which have the same essential character or basic 
biological functionality as the B-glucanase (or cellulase), and include allelic variants. The 
essential character of B-glucanase is that it is an en2yme that exhibits EC 3.2.1.4 activity or that it 
can cleave l-*3, l->4 and/or 1~*6 links in fi-D-glucan, such as cereal or oat spelt (fi)-glucan. 
Preferably a variant polypeptide has the same activity as the B-glucanase. A polypeptide having 
the same essential character as B-glucanase may be identified by using a cellulose or B-glucan 
degradation assay, for example as described later. 

Variants of SEQ ID No.2, 4 or 6 also include sequences which vary from SEQ ID No.2, 4 or 
6 but which are not necessarily derived from the naturally occurring B-glucanase protein. These 
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variants may be described as having a % homology to SEQ ID No, 2, 4 or 6 or having a number 
of substitutions within this sequence. Alternatively a variant may be encoded by a 
polynucleotide which hybridizes to SEQ ID No. 1, 3 or 5. 

The variants can be defined in a similar manner to the variants of SEQ ID No. 1, 3 or 5. 
Thus the variants may comprise variant sequences derived from other strains of Talaromyces. 
Other variants can be identified from other Talaromyces strains by looking for B-D-glucanase 
activity and cloning and sequencing as before. Variants may include the deletion, modification 
or addition of single amino acids or groups of amino acids within the protein sequence, as long 
as the peptide maintains the basic biological functionality of the B-glucanase. 

Conservative substitutions may be made, for example according to the following Table. 
Amino acids in the same block in the second column and preferably in the same line in the third 
column may be substituted for each other. Preferably substitutions do not affect the folding or 
activity of the polypeptide. 



I ALIPHATIC 


Non-polar 


GAP 






ILV 




Polar-uncharged 


CSTM 






NQ 




Polar-charged 


DE 






KR 


AROMATIC 




HF WY 



Shorter polypeptide sequences are within the scope of the invention. For example, a peptide 
of at least 50 amino acids oi; up to 60, 70, 80, 100, 150 or 200 amino acids in length is considered 
to fall within the scope of the invention as long as it demonstrates the basic biological 
functionality of the B-glucanase. In particular, but not exclusively, this aspect of the invention 
encompasses the situation when the protein is a fragment of the complete protein sequence and 
may comprise or represent a B-D-glucan (or mannose) binding region or a fi-D-glucan (or 
cellulose) cleaving region. 

Modifications 

Polypeptides of the invention may be chemically modified, e.g. post-translationally 
modified. For example, they may be glycosylated (one or more times, by the same or different 
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sugars) or comprise modified amino acid residues. They may also be modified by the addition of 
histidine residues (to assist their purification) or by the addition of a signal sequence (to promote 
insertion into the cell membrane). The polypeptide may have one or more (N) amino- or (C) 
carboxyl-terminal extensions, such as an amino-terminal methionine residue, a small linker 
peptide of up to about 20-25 residues, or a (small) extension that facilitates purification, such as a 
poly-histidine or T7 tag, an antigenic epitope or a (e.g. maltose) binding domain 14 (e.g. at the . 
C -terminus). These extensions may or may not be added via a linker. 

A polypeptide of the invention may be labelled with a revealing label. The revealing label 
may be any suitable label which allows the polypeptide to be detected. Suitable labels include 
radioisotopes, e.g. 125 1, 35 S, enzymes, antibodies, polynucleotides and linkers such as biotin. 

The polypeptides may be modified to include non-naturally occurring amino acids or to 
increase the stability of the polypeptide. When the proteins or peptides are produced by 
synthetic means, such amino acids may be introduced during production. The proteins or 
peptides may also be modified following either synthetic or recombinant production. 

The polypeptides of the invention may also be produced using, or comprise, (one or more) 
D-amino acids. 

A number of side chain modifications are known in the art and may be made to the side 
chains of the proteins or peptides of the present invention. Such modifications include, for 
example, modifications of amino acids by reductive alkylation by reaction with an aldehyde 
followed by reduction with NaBH 4 , amidination with methylacetimidate or acylation with acetic 
anhydride. '- v 

The sequences provided by the present invention may also be used as starting materials for 
the construction of "second generation" enzymes. "Second generation" glucanases are 
glucanases, altered by mutagenesis techniques (e.g. site-directed mutagenesis), which have 
properties that differ from those of wild-type glucanases or recombinant glucanases such as those 
produced by the present invention. For example, the temperature or pH optimum, specific 
activity, substrate affinity or thermostability may be altered so as to be better suited for 
application in a defined process. 

Amino acids essential to the activity of the glucanases of the invention, and therefore 
preferably subject to substitution, may be identified according to procedures known in the art, 
such as site-directed mutagenesis or alanine-scanning mutagenesis 10 . In the latter technique 
mutations are introduced at every residue in the molecule, and the resultant mutant molecules are 
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tested for biological activity (e.g. glucanase activity) to identify amino acid residues that are 
critical to the activity of the molecule. Sites of enzyme-substrate interaction can also be 
determined by analysis of crystal structure as determined by such techniques as nuclear magnetic 
resonance, crystallography or photo-affinity labelling 11,12,13 or molecular modelling. 

The use of yeast and fungal host cells is expected to provide for such post-translational 
modifications (e.g. proteolytic processing, myristilation, glycosylation, truncation, and tyrosine, 
serine or threonine phosphorylation) as may be needed to confer optimal biological activity on 
recombinant expression products of the invention. 

Polypeptides of the invention may be provided in a form such that they are outside their 
natural cellular environment. Thus, they may be substantially isolated or purified, as discussed 
above, or in a cell in which they do not occur in nature, e.g. a cell of other fungal species, 
animals, yeast or bacteria. 

C. Recombinant Aspects . 

The invention also provides vectors comprising a polynucleotide of the invention, including 
cloning and expression vectors, and methods of growing, transforming or transfecting such 
vectors in a suitable host cell, for example under conditions in which expression of a polypeptide 
of the invention occurs, Provided also are host cells comprising a polynucleotide or vector of 
the invention wherein the polynucleotide is heterologous to the genome of the host cell. The 
term "heterologous", usually with respect to the host cell, means that the polynucleotide does not 
naturally occur in the genome of the host cell or that the polypeptide is not naturally produced by 
that cell. Preferably, the host cell is a yeast cell, for example a yeast cell of the genus 
Kluyveromyces or Saccharomyces or a fungal cell, for example of the genus Aspergillus. 

Polynucleotides of the invention can be incorporated into a recombinant replicable vector, 
for example a cloning or expression vector. The vector may be used to replicate the nucleic acid 
in a compatible host cell. Thus in a further embodiment, the invention provides a method of 
making polynucleotides of the invention by introducing a polynucleotide of the invention into a 
replicable vector, introducing the vector into a compatible host cell, and growing the host cell 
under conditions which bring about replication of the vector. The vector may be recovered from 
the host cell. Suitable host cells are described below in connection with expression vectors. 
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Vectors 

The polynucleotide of the invention may inserted into an expression cassette. The vector 
into which the expression cassette or polynucleotide of the invention is inserted may be any 
vector which may conveniently be subjected to recombinant DNA procedures, and the choice of 
the vector will often depend on the host cell into which it is to be introduced. Thus, the vector 
may be an autonomously replicating vector, i.e. a vector which exists as an extra-chromosomal 
entity, the replication of which is independent of chromosomal replication, e.g. a plasmid. 
Alternatively, the vector may be one which, when introduced into a host cell, is integrated into 
the host cell genome and replicated together with the chromosome(s) into which it has been 
integrated. 

Preferably, a polynucleotide of the invention in a vector is operably linked to a regulatory 
sequence which is capable of providing for the expression of the coding sequence by the host 
cell, i.e. the vector is an expression vector. The term "operably linked" refers to a juxtaposition 
wherein the components described are in a relationship permitting them to function in their 
intended manner. A regulatory sequence such as a promoter, enhancer or other expression 
regulation signal "operably linked" to a coding sequence is positioned in such a way that 
expression of the coding sequence is achieved under condition compatible with the control 
sequences. 

The vector may be a plasmid, cosmid, virus or phage vector, usually provided with an origin 
of replication, optionally a promoter for the expression of the polynucleotide and optionally an 
enhancer and/or a regulator of the promoter. A terminator sequence may be present, as may be a 
polyadenylation sequence. The vector may contain one or more selectable marker genes, for 
example an ampicillin resistance gene (in the case of a bacterial plasmid) or a neomycin 
resistance gene (for a mammalian vector). Vectors may be used in vitro, for example for the 
production of RNA or used to transfect or transform a host cell. They may comprise two or more 
polynucleotides of the invention, for example for oyerexpression. 

The DNA sequence encoding the polypeptide is preferably introduced into a suitable host as 
part of an expression cassette (or construct) in which the DNA sequence is operably linked to 
expression signals which are capable of directing expression of the DNA sequence in the host 
cells. For transformation of the suitable host with the expression construct transformation 
procedures are available which are well known to the skilled person 3 ' 4 . The expression construct 
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can be used for transformation of the host as part of a vector carrying a selectable marker, or the 
expression construct may be co-transformed as a separate molecule together with the vector 
carrying a selectable 'marker. The vector may comprise one or more selectable marker genes. 

Preferred selectable markers 15,16 include but are not limited to those that complement a 
defect in the host cell or confer resistance to a drug! They include e.g. versatile marker genes 
that can be used for transformation of most filamentous fungi and yeasts such as acetamidase 
genes or cDNAs (the amdS, niaD,facA genes or cDNAs from A. nidulans, Aoryzae, or A.niger), 
or genes providing resistance to antibiotics like G418, hygromycin, bleomycin, kanamycin, 
phleomycin or benomyl resistance (benA). Alternatively, specific selection markers can be used 
such as auxotrophic markers which require corresponding mutant host strains: e.g. URA3 (from 
S.cerevisiae or analogous genes from other yeasts), pyrG or pyrA (from A. nidulans or A.niger), 
argB (from A. nidulans ox A.niger) or trpC. In a preferred embodiment the selection marker is 
deleted from the transformed host cell after introduction of the expression construct so as to 
obtain transformed host cells capable of producing the polypeptide which are free of selection 
marker genes 21 ' 22 . 

Other markers include ATP synthetase, subunit 9 (oliC), orotidine-5' -phosphate- . 
decarboxylase (pvrA), the bacterial G418 resistance gene (this may also be used in yeast, but not 
in fungi), the ampicillin resistance gene (E. coli\ the neomycin resistance gene (Bacillus) and the 
E. coli uidA gene, coding for p-glucuronidase (GUS). Vectors may be used in vitro, for example 
for the production of RNA or used to transfect or transform a host cell. 

For most filamentous fungi and yeast, the vector or expression construct is preferably 
integrated in the genome of the host cell in order to obtain stable transformants. However, for 
certain yeasts also suitable episomal vectors are available into which the expression construct can 
be incorporated for stable and high level expression, examples thereof include vectors derived 
from the 2\x and pKDl plasmids of Saccharomyces and Kluyveromyces, respectively, or vectors 
containing an AMA sequence (e.g. AMA1 from Aspergillus 3,20 ). In case the expression 
constructs are integrated in the host cells genome, the constructs are either integrated at random 
loci in the genome, or at predetermined target loci using homologous recombination, in which 
case the target loci preferably comprise a highly expressed gene. A highly expressed gene is a 
gene whose mRNA can make up at least 0.01% (w/w) of the total cellular mRNA, e.g. under 
induced conditions, or alternatively, a gene whose gene product can make up at least 0.2% (w/w) 
of the total cellular protein, or, in case of a secreted gene product, can be secreted to a level of at 
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least 0.05g/L A number of examples of suitable highly expressed genes are provided below. 

A vector or expression construct for a given host cell may comprise the following elements. , 
operably linked to each other, in a consecutive order from the 5 -end to 
3 f -end relative to the coding strand of the sequence encoding the polypeptide of the first 
invention: 

(1) a promoter sequence capable of directing transcription of the DNA sequence encoding 
. the polypeptide in the given host cell; 

(2) optionally, a signal sequence capable of directing secretion of the polypeptide from the 
given host cell into a culture medium; 

(3) a DNA sequence encoding a mature and preferably active form of the polypeptide; and 
preferably also 

(4) a transcription termination region (terminator) capable of terminating transcription 
downstream of the DNA sequence encoding the polypeptide. 

Downstream of the DNA sequence encoding the polypeptide there may be a 3' untranslated 
region containing one or more transcription termination sites (e.g. a terminator); The origin of the 
terminator is less critical. The terminator can e.g. be native to the DNA sequence encoding the 
polypeptide. However, preferably a yeast terminator is used in yeast host cells and a filamentous 
fungal terminator is used in filamentous fungal host cells. More preferably, the terminator is 
endogenous to the host cell (in which the DNA sequence encoding the polypeptide is to be 
expressed). I 1 

Enhanced expression of the polynucleotide encoding the polypeptide of the invention may 
also be achieved by the selection of heterologous regulatory regions, e.g. promoter, secretion 
leader and/or terminator regions, which may serve to increase expression and, if desired, 
secretion levels of the protein of interest from the expression host and/or to provide for the 
inducible control of the expression of the polypeptide of the invention. 

Aside from the promoter native to the gene encoding the polypeptide of the invention, other 
promoters may be used to direct expression of the polypeptide of the invention. The promoter 
may be selected for its efficiency in directing the expression of the polypeptide of the invention 
in the desired expression host. 

Promoters/enhancers and other expression regulation signals may be selected to be 
compatible with the host cell for which the expression vector is designed. For example 
prokary otic promoters may be used, in particular those suitable for use in E. coli strains. When 
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expression is carried out in mammalian cells, mammalian promoters may be used. . Tissues- 
specific promoters, for example hepatocyte cell-specific promoters, may also be used. Viral 
promoters may also be used, for example the Moloney murine leukaemia virus long terminal 
repeat (MMLV LTR), promoter rous sarcoma virus (RSV) LTR promoter, SV40 (e.g. large T 
antigen) promoter, human cytomegalovirus (CMV) IE promoter, herpes simplex virus promoters 
or adenovirus promoters, HSV promoters such as the HSV IE promoters), or HP V promoters, 
particularly the HPV upstream regulatory region (URR).. Yeast promoters include S. cerevisiae 
GAL4 and ADH promoters, the S. pombe nmt 1 and adh promoter. Mammalian promoters 
include the metallothionein promoter which may be induced in response to heavy metals such as 
cadmium and B-actin promoters. Tissue-specific promoters, in particular endothelial or neuronal 
cell specific promoters (for example the DDAHI and DDAHII promoters), are especially 
preferred. 

A variety of promoters 15,16 can be used that are capable of directing transcription in the host 
cells of the invention. Preferably the promoter sequence is derived from a highly expressed gene 
as previously defined. Examples of preferred highly expressed genes from which promoters are 
preferably derived and/or which are comprised in preferred predetermined target loci for 
integration of expression constructs, include but are not limited to genes encoding glycolytic 
enzymes such as triose-phosphate isomerases (TPI), glyceraldehyde-phosphate dehydrogenases 
(GAPDH), phosphoglycerate kinases (PGK), pyruvate kinases (PYK), alcohol dehydrogenases 
(ADH), as well as genes encoding amylases, glucoamylases, proteases, xylanases, 
cellobiohydrolases, B-galactosidases, alcohol (methanol) oxidases, elongation factors and 
ribosomal proteins. Specific examples of suitable highly expressed genes include e.g. the LAC4 
gene from Kluyveromyces sp., the methanol oxidase genes (A OX and MOX) from Hansenula and 
Pichia, respectively, the glucoamylase (glaA) genes from A.niger andA.awamori, the A.oryzae 
TAKA-amylase gene, the A.nidulans gpdA gene and the T.reesei cellobiohydrolase genes. 

Examples of strong constitutive and/or inducible promoters which are preferred for use in 
fungal expression hosts 13,16 are those which are obtainable from the fungal genes for xylanase 
(xlnA), phytase, ATP-synthetase, subunit 9 (o/zC), triose phosphate isomerase ((pi), alcohol 
dehydrogenase (AdhA), a-amylase (amy), amyloglucosidase (AG - from the glaA gene), 
acetamidase (amdS) and glyceraldehyde-3 -phosphate dehydrogenase (gpd) promoters. 

Examples of strong yeast promoters are those obtainable from the genes for alcohol 
dehydrogenase, lactase, 3 -phosphoglycerate kinase and triosephosphate isomerase. 
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Examples of strong bacterial promoters are the a-amylase and SPo2 promoters as well as 
promoters from extracellular protease genes. 

Promoters suitable for plant cells include napaline synthase (nos), octopine synthase (ocs), 
mannopine synthase (mas), ribulose small subunit (rubisco ssu), histone, rice actin, phaseolin, 
cauliflower mosaic virus (CMV) 35S and 19S and circovirus promoters. All these promoters are 
readily available in the art. 

The vector may further include sequences flanking the polynucleotide giving rise to RNA 
which comprise sequences homologous to eukaryotic genomic sequences, preferably mammalian 
genomic sequences, or viral genomic sequences. This will allow the introduction of the 
polynucleotides of the invention into the genome of eukaryotic cells or viruses by homologous 
recombination. In particular, a plasmid vector comprising the expression cassette flanked by 
viral sequences can be used to prepare a viral vector suitable for delivering the polynucleotides 
of the invention to a mammalian cell. Other examples of suitable viral vectors include herpes 
simplex viral vectors 18 ' 19 and retroviruses, including lentiviruses, adenoviruses, adeno-associated 
viruses and HPV viruses (such as HPV-16 or HPV-1 8). Gene transfer techniques using these 
viruses are known to those skilled in the art. Retrovirus vectors for example may be used to 
stably integrate the polynucleotide giving rise to the antisense RNA into the host genome. 
Replication-defective adenovirus vectors by contrast remain episomal and therefore allow 
transient expression. 

The vector may contain a polynucleotide of the invention oriented in an antisense direction 
to provide for the production of antisense RNA. This may be used to reduce, if desirable, the 
levels of expression of the polypeptide. 

Host cells and Expression 



In a further aspect the invention provides a process for preparing a polypeptide according to the 
invention which comprises cultivating a host cell (e.g. transformed or transfected with an 
expression vector as described above) under conditions to provide for expression (by the vector) 
of a coding sequence encoding the polypeptide, and optionally recovering the expressed 
polypeptide. Polynucleotides of the invention can be incorporated into a recombinant replicable 
vector, e.g. an expression vector. The vector may be used to replicate the nucleic acid in a 
compatible host cell. Thus in a further embodiment, the invention provides a method of making a 



'^0998A1_L> 



WO 01/70998 



22 



PCT/EPO 1/031 74 



polynucleotide of the invention by introducing a polynucleotide of the invention into a replicable 
vector, introducing the vector into a compatible host cell, and growing the host cell under 
conditions which bring about the replication of the vector. The vector may be recovered from 
the host cell. Suitable host cells include bacteria such as E. coli, yeast, mammalian cell lines and 
other eukaryotic cell lines, for example insect cells such as Sf9 cells and (e.g. filamentous) fungal 
cells. 

Preferably the polypeptide is produced as a secreted protein in which case the DNA 
sequence encoding a mature form of the polypeptide in the expression construct is operably 
linked to a DNA sequence encoding a signal sequence. Preferably the signal sequence is native 
(homologous) to the DNA sequence encoding the polypeptide. Alternatively the signal sequence 
is foreign (heterologous) to the DNA sequence encoding the polypeptide, in which case the 
signal sequence is preferably endogenous to the host cell in which the DNA sequence is 
expressed. Examples of suitable signal sequences for yeast host cells are the signal sequences 
derived from yeast a-factor genes. Similarly, a suitable signal sequence for filamentous fungal 
host cells is e.g. a signal sequence derived from a filamentous fungal amyloglucosidase (AG) 
gene, e.g. the Aniger glaA gene. This may be used in combination with the amyloglucosidase 
(also called (gluco)amylase) promoter itself, as well as in combination with other promoters. 
Hybrid signal sequences may also be used with the context of the present invention. 

Preferred heterologous secretion leader sequences are those originating from the fungal 
amyloglucosidase (AG) gene {glaA - both 18 and 24 amino acid versions e.g. from Aspergillus), 
the a-factor gene (yeasts e.g. Saccharomyces and Kluyveromyces) or the a-amylase gene 
{Bacillus). 

The vectors may be transformed or transfected into a suitable host cell as described above to 
provide for expression of a polypeptide of the invention. This process may comprise culturing a 
host cell transformed with an expression vector as described above under conditions to provide 
for expression by the vector of a coding sequence encoding the polypeptide. 

A further aspect of the invention thus provides host cells transformed or transfected with or 
comprising a polynucleotide or vector of the invention. Preferably the polynucleotide is carried 
in a vector for the replication and expression of the polynucleotide. The cells will be chosen to 
be compatible with the said vector and may for example be prokaryotic (for example bacterial), 
fungal, yeast or plant cells. . .. » 

A heterologous host may also be chosen wherein the polypeptide of the invention is 
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produced in a form which is substantially free from other cellulose-degrading enzymes. This 
may be achieved by choosing a host which does not normally produce such enzymes such as 
Kluyveromyces lactis. 

The invention encompasses processes for the production of the polypeptide of the invention 
by means of recombinant expression of a DNA sequence encoding the polypeptide. For this 
purpose the DNA sequence of the invention can be used for gene amplification and/or exchange 
of expression signals, such as promoters, secretion signal sequences, in order to allow economic 
production of the polypeptide in a suitable homologous or heterologous host cell. A homologous 
host cell is herein defined as a host cell which is of the same species or which is a variant within 
the same species as the species from which the DNA sequence is derived. 

Suitable host cells are preferably prokaryotic microorganisms such as bacteria, or more 
preferably eukaryotic organisms, for example fungi, such as yeasts or filamentous fungi, or plant 
cells. In general, yeast cells are preferred over fungal cells because they are easier to manipulate. 
However, some proteins are either poorly secreted from yeasts, or in some cases are not 
processed properly (e.g. hyperglycosylation in yeast). In these instances, a fungal host organism 
should be selected. 

The host cell may over-express the polypeptide, and techniques for engineering 
over-expression are well known 3 . The host may thus have two or more copies of the encoding 
polynucleotide (and the vector may thus have two or more copies accordingly). 

Bacteria from the genus Bacillus are very suitable as heterologous hosts because of their 
capability to secrete proteins into the culture medium. Other bacteria suitable as hosts are those 
from the genera Streptomyces and Pseudomonas. A preferred yeast host cell for the expression of 
the DNA sequence encoding the polypeptide is of the genera Saccharomyces, Kluyveromyces, 
Hansenula, Pichia, Yarrowia, and Schizosaccharomyces. More preferably a yeast host cell is 
selected from the group consisting of the species Saccharomyces cerevisiae, Kluyveromyces 
lactis (also known as Kluyveromyces marxianus var. lactis), Hansenula polymorpha, Pichia 

pastoris, Yarrowia lipolytica,and Schizosaccharomyces pombe. 7 

» 

Most preferred are, however, (e.g. filamentous) fungal host cells. Preferred filamentous 
fungal host cells are selected from the group consisting of the genera Aspergillus, Trichoderma, 
Fusarium, Disporotrichum, Penicillium, Acremonium, Neurospora, Thermoascus, 
Myceliophtora, Sporotrichum, Thielavia, and Talaromyces. More preferably a filamentous fungal 
host cell is of the species Aspergillus oyzae, Aspergillus sojae, Aspergillus nidulans, or a species 
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from the Aspergillus niger Group (as defined by Raper and Fennell, The Genus Aspergillus, The 
Williams & Wilkins Company, Baltimore, pp 293-344, 1965). These include but are not limited 
to Aspergillus niger, Aspergillus awamori, Aspergillus tubigensis, Aspergillus aculeatus, 
Aspergillus foetidus, Aspergillus nidulans, Aspergillus japonicus, Aspergillus oryzae and 
Aspergillus ficuum, and further consisting of the species Trichoderma reesei, Fusarium 
graminearum, Penicillium chrysogenum, Acremonium alabamense, Neurospora crassa, 
Myceliophtora thermophilum, Sporotrichum cellulophilum, Disporotrichum dimorphosporum 
and Thielavia terrestris. 

Examples of preferred expression hosts within the scope of the present invention are fungi 
such as Aspergillus species (described in EP-A- 184,438 and EP-A-284,603) and Trichoderma 
species; bacteria such as Bacillus species (described in EP-A- 134,048 and EP-A-253,455), e.g. 
Bacillus subtilis, Bacillus licheniformis, Bacillus amyloliquefaciens, Pseudomonas species; and 
yeasts such as Kluyveromyces species (described in EP-A-096,430 e.g. Kluyveromyces lactis in 
EP-A-30 1,670) and Saccharomyces species, e.g. Saccharomyces cerevisiae. 

Host cells according to the invention include plant cells, and the invention therefore extends 
to transgenic organisms, such as plants and parts thereof, which contain one or more cells of the 
invention. The cells may heterologously express the polypeptide of the invention or may 
heterologously contain one or more of the polynucleotides of the invention. The transgenic (or 
genetically modified) plant may therefore have inserted (e.g. stably) into its genome a sequence 
encoding one or more of the polypeptides of the invention. The transformation of plant cells can 
be performed using known techniques, for example using a Ti or a Ri plasmid from 
Agrobacterium tumefaciens. The plasmid (or vector) may thus contain sequences necessary to 
infect a plant, and derivatives of the Ti and/or Ri plasmids may be employed. 

Alternatively direct infection of a part of a plant, such as a leaf, root or stem can be effected. 
In this technique the plant to be infected can be wounded, for example by cutting the plant with 
a razor or puncturing the plant with a needle or rubbing the plant with an abrasive. The wound is 
then innoculated with the Agrobacterium. The plant or plant part can then be grown on a 
suitable culture medium and allowed to develop into a mature plant. Regeneration of 
transformed cells into genetically modified plants can be achieved by using known techniques, 
for example by selecting transformed shoots using an antibiotic and by sub-culturing the shoots 
on a medium containing the appropriate -nutrients, plant hormones and the like. 17 
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Culture of host cells and recombinant production 

The invention also includes cells that have been modified to express the fl-glucanase or a 
variant thereof. Such cells include transient, or preferably stable higher eukaryotic cell lines, 
such as mammalian cells or insect cells, lower eukaryotic cells, such as yeast and (e.g. 
filamentous) fungal cells or prokaryotic cells such as bacterial cells. 

It is also possible for the proteins of the invention to be transiently expressed in a cell line or 
on a membrane, such as for example in a baculovirus expression system. Such systems, which, 
are adapted to express the proteins according to the invention, are also included within the scope 
of the present invention. 

According to the present invention, the production of the polypeptide of the invention can be 
effected by the culturing of microbial expression hosts, which have been transformed with one or 
more polynucleotides of the present invention, in a conventional nutrient fermentation medium. 

The recombinant host cells according to the invention may be cultured using procedures 
known in the art. For each combination of a promoter and a host cell, culture condition are 
available which are conducive to the expression the DNA sequence encoding the polypeptide.. 
After reaching the desired cell density or titre of the polypeptide the culture is stopped and the 
polypeptide is recovered using known procedures. 

The fermentation medium can comprise a known culture medium containing a carbon 
source (e.g. glucose, maltose, molasses, cellulose, fl-glucan etc.), a nitrogen source (e.g. 
ammonium sulphate, ammonium nitrate, ammonium chloride, etc.), an organic nitrogen source 
(e.g. yeast extract, malt extract, peptone, etc.) and inorganic nutrient sources (e.g. phosphate, 
magnesium, potassium, zinc, iron, etc.). Optionally, an inducer (e.g. cellulose, fl-glucan, maltose 
or maltodextrin) may be included. 

The selection of the appropriate medium may be based on the choice of expression host 
and/or based on the regulatory requirements of the expression construct. Such media are known 

to those skilled in the art. The medium may, if desired, contain additional components favouring 

* 

the transformed expression hosts over other potentially contaminating microorganisms. 

The fermentation can be performed over a period of 0.5-30 days. It may be a batch, 
continuous or fed-batch process, suitably at a temperature in the range of between 0 and 45°C 
and, for example, at a pH between 2 and' 1 OrPrefenred fermentation conditions are a temperature 
in the range of between 20 and 37°C and/or apH between 3 and 9. The appropriate conditions 
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are usually selected based on the choice of the expression host and the protein to be expressed. 

After fermentation, if necessary, the cells can be removed from the fermentation broth by 
means of centrifugatibn or filtration. After fermentation has stopped or after removal of the cells, 
the polypeptide of the invention may then be recovered and, if desired, purified and isolated by 
conventional means. 

D. Uses of the polypeptides and methods of processing plant or cellulose-containing materials 

The polypeptides of the invention which possess B-glucanase (or cellulase) activity may be 

used to treat fungal or plant material including plant pulp and plant extracts. For example, they 

may be used to treat cereals, vegetables, fruits or extracts thereof. They may be used in washing 

compositions (liquid or solid, e.g. powder) and/or in detergent compositions. Conveniently the 

polypeptide of the invention is combined with suitable (solid or liquid) carriers or diluents 

including buffers to produce a composition/ enzyme preparation. The polypeptide may be 

attached to or mixed with a carrier, e.g. immobilized on a solid carrier. Thus the present 

invention provides in a further aspect a composition comprising a polypeptide of the invention. 

This may be in a form suitable for packaging, transport and/or storage, preferably where the 

glucanase activity is retained. Compositions include paste, liquid, emulsion, powder, flake, 

granulate, pellet or other extrudate form. 

The composition may further comprise additional ingredients such as one or more enzymes, 

i ■ !": 
for example pectinases, including an (e.g. endo)-arabinanase and rhamnogalacturoniase, other 

cellulases, xylanases, galactanases, mannanases and/or xyloglucanases. The polypeptide is 

typically stably formulated either in liquid or dry form. Typically, the product is made as a 

composition which will optionally include, for example, a stabilising buffer and/or preservative. 

The compositions may also include other enzymes capable of digesting plant material or 

cellulose, for example other cellulases, e.g. (B-D-)glucanases. For certain applications, 

immobilization of the enzyme on a solid matrix or incorporation on or into solid carrier particles 

may be preferred. The composition may also include a variety of other plant material-degrading 

enzymes, for example (other) cellulases and other pectinases. 

The polypeptides and compositions of the invention may therefore be used in a method of 

processing plant material to degrade or modify the cellulose constituents (e.g. B-D-glucan) of the 

cell walls of the plant or fungal material. Thus in a further aspect, the present invention provides 
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a method of degrading or modifying a plant cell or cellulose which method comprises contacting 
the plant, fungal cell or cellulose with a polypeptide or composition of the invention. 

The invention also provides a method of processing a plant material which method 
comprises contacting the plant material with a polypeptide or composition of the invention to 
degrade or modify the cellulose in the plant material. Preferably the plant material is a plant pulp 
or plant extract. 

In particular, the degradation preferably comprises cleaving of B-glucan subunits of a 
cellulose component of the plant cell wall. The plant material is preferably a cereal, vegetable, 
fruit or vegetable or fruit pulp or extract. The present invention further provides a processed 
plant material obtainable by contacting a plant material with a polypeptide or composition of the 
invention. 

The present invention also provides a method for reducing the viscosity of a plant extract 
which method comprises contacting the plant extract with a polypeptide or composition of the 
invention in an amount effective in degrading cellulose (or fl-D-glucan) contained in the plant 
extract. 

Plant and cellulose-containing materials include plant pulp, parts of plants and plant 
extracts. In the context of this invention an extract from a plant material is any substance which 
can be derived from plant material by extraction (mechanical and/or chemical), processing or by 
other separation techniques. The extract may be juice, nectar, base, or concentrates made 
thereof. The polypeptide may be used in (e.g. total) liquefaction and/or (advanced) maceration, 
for exampel in preparing (fruit) juices. The plant material may comprise or be derived from 
vegetables, e.g., carrots, celery, onions, legumes or leguminous plants (soy, soybean, peas) or 
fruit, e.g., pome or seed fruit (apples, pears, quince etc.), grapes, tomatoes, citrus (orange, lemon, 
lime, mandarin), melons, prunes, cherries, black currants, redcurrants, raspberries, strawberries, 
cranberries, pineapple and other tropical fruits, trees and parts thereof (e.g. pollen, from pine 
trees), or cereal (oats, barley, wheat, maize, rice). 

The polypeptides of the invention can thus be used to treat plant material including plant 
pulp and plant extracts. They may also be used to treat liquid or solid foodstuffs or edible 
foodstuff ingredients. 

Typically, the polypeptides of the invention are used as a composition/ enzyme preparation 
as described above. The composition will generally be added to plant pulp obtainable by, for 
example mechanical processing such as crushing or milling plant material. Incubation of the 
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composition with the plant will typically be carried out for at time of from 10 minutes to 5 hours, 
such as 30 minutes to 2 hours, preferably for about 1 hour. The processing temperature is 
preferably 10-55°C, e.g. from 15 to 25 °C, optimally about 20°C and one can use 10-3 OOg, 
preferably 30-70g, optimally about 50g of enzyme per ton of material to be treated. All the 
enzyme(s) or their compositions used may be added sequentially or at the same time to the plant 
pulp. Depending on the composition of the enzyme preparation the plant material may first be 
macerated (e.g. to a puree) or liquefied. Using the polypeptides of the invention processing 
parameters such as the yield of the extraction, viscosity of the extract and/or quality of the extract 
can be improved. 

Alternatively, or in addition to the above, a polypeptide of the invention may be added to the 
raw juice obtained from pressing or liquefying the plant pulp. Treatment of the raw juice will be 
carried out in a similar manner to the plant pulp in respect of dosage, temperature and holding 
time. Again, other enzymes such as those discussed previously may be included. Typical 
incubation conditions are as described in the previous paragraph. Once the raw juice has been 
incubated with the polypeptides of the invention, the juice is then centrifuged or (ultra) filtered to 
produce the final product. 

A composition containing a polypeptide of the invention may also be used during the 
preparation of fruit or vegetable purees. 

The polypeptide of the invention may also be used in brewing, wine making, distilling or 
baking. It may therefore used in the preparation of alcoholic beverages such as wine and beer, for 
example to improve the filterability or clarity of wine. In baking the polypeptide may improve 
the dough structure, modify its stickiness or suppleness, improve the loaf volume and/or crumb 
structure. 

Polypeptide of the invention can be used in brewing. In brewing, filtration problems can 
arise in mash-bills, which include over-modified malts, due to the present of B-glucans released 
at high temperatures. The reduction in the speed of wort filtration can be one of the main 
problems encountered. Additonal problems are colloidal stability and haze formation in finished 
beer. These may be caused by the same carbohydrate complexes, especially during high gravity 
brewing. The polypeptides of the invention may. be able to improve the filterability of the wort, 
for example after mashing. In this way, less wort liquor may be retained in the spent grains, and 
the yield of extract can be improved. A more efficient filtration can result in an increased 
efficiency of the brewhouse. Thus, in general, the polypeptides of the invention can be used to 
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improve the filterability or filtration rate of the (e.g. finished) beer. 

The polypeptides of the invention may also prevent or at least mitigate haze formation 
during beer manufacture. B-glucans found in the endosperm walls of cereals can be carried over 
into the finished beer. This may cause haze and loss of clarity. The polypeptides of the 
invention may prevent or at least mitigate accumulation of particles due to the hydrolysis of 
B-glucan. Hence the polypeptides of the invention may be used to increase the colloidal stability 
of the (e.g. finished) beer. 

The polypeptides find use in a number of industrial areas due to their glucanase activity. 
These can include not only alcohol production, but also in biomethanation, in bread making and 
in baking, in dental hygiene (for example dental or oral compositions), in the treatment or fabric, 
clothes or manufacture of leather, in the manufacture of paper, in pharmaceuticals, in tea, in the 
preparation or treatment of textiles, in detergent or washing compositions, and in the treatment of 
waste. One aspect of the invention is therefore a food or foodstuff comprising the polypeptide, 
such as an alcoholic beverage, bread, dough or tea. The polypeptide may be formulated into a 
suitable compositions for any of these uses. The polypeptide may be present in an aqueous 
composition (eg. hot water), preferably with one or more fungicides, in order to treat plant 
material (eg. bulbs), especially to control parasitic insects, mites and nematodes. 

As the polypeptides of the invention can degrade glucan it may be added to foods or • 
foodstuffs (for example by consumption by humans). It is known that soluble p-D-glucan is 
associated with the lowering of serum cholesterol and triglycerides, and therefore the 
polypeptides of the invention can be used to lower serum cholesterol and triglyceride levels in 
humans or animals, for example in hyperlipemic individuals. The invention therefore includes 
pharmaceutical and veterinary compositions that comprise the polypeptide of the invention and 
the pharmaceutically or veterinarily acceptable carrier. The polypeptides can thus be used in the 
manufacture of a medicament for preventing or treating hyperlipemia, or high serum cholesterol 
and triglyceride levels or disorders resulting therefrom. 

Polypeptides of the invention may also display anti-fungal activity. They may be able to 
degrade fungal cell walls, and thus can be employed for fungal cell wall lysis, in order to open 
the cells. This may release intracellular proteins. In such a way the polypeptides may be used to 
prepare yeast and/or fungal extracts. 
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E. Animal Feeds 

The invention additionally relates to foodstuffs or an animal feed composition comprising 

one or more polypeptides of the invention. The polypeptide may be present in the feed at a 

i . ■" 

concentration different from its natural concentration. Preferred amounts are from 0.1 to 100, 
such as 0.5 to 50, preferably 1 to 10, mg per kg feed. 

The invention also relates to a process for the preparation of an animal feed composition, the 
process comprising adding to one or more edible feed substance(s) or ingredient(s) a polypeptide 
of the invention. The polypeptides can be added to the animal feed composition separately from 
the feed substances or ingredients, individually or in combination with other feed additives. The 
polypeptide can be an integral part of one of the feed substances or ingredients. 

The polypeptides of the invention may also be added to animal feeds rich in cellulose to 
improve the breakdown of the plant cell wall leading to improved utilisation of the plant 
nutrients by the animal. The polypeptides of the invention may be added to the feed or silage if 
pre-soaking or wet diets are preferred. Advantageously, the polypeptides of the invention may 
continue to degrade cellulose in the feed in vivo. Fungal derived polypeptides of the invention in 
particular generally have lower pH optima and are capable of releasing important nutrients in 
such acidic environments as the stomach of an animal. The invention thus also contemplates 
(e.g. animal) feeds or foodstuffs comprising one or more polypeptides of the invention. 

The polypeptides of the invention may also be used during the production of milk 
substitutes (or replacers) from soy bean. These milk substitutes can be consumed by both 
humans and animals. A typical problem during the preparation of these milk substitutes is the 
high viscosity of the soy bean slurry, resulting in the need for an undesirable dilution of the 
slurry to a concentration of dry solids of 10 to 15%. An enzyme preparation containing a 
polypeptide of the invention can be added to, or during the processing of, the slurry, enabling 
processing at a higher concentration (typically 40 to 50%) dry solids. The' enzyme may also be 
used in the preparation of savoury product(s), e.g. from soy bean. 

The composition may additionally comprise (particularly when being formulated for use in 
animal feed) one or more ionophores, oxidising agents, surfactants, rumen protected amino acids, 
enzyme enhancers or enzymes which may be produced naturally in -the gastro-intestinal tract of 
the animals to be fed. 
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When added to feeds (including silage) for ruminants or monogastric animals (eg. poultry oi 
swine) the feeds may comprise cereals such as barley, wheat, maize, rye or oats or cereal 
by-products such as wheat bran or maize bran, or other plant materials such as soy beans and 
other legumes. The enzyme(s) may significantly improve the break-down of plant cell walls 
which leads to better utilisation of the plant nutrients by the animal. As a consequence, growth 
rate and/or feed conversion may be improved. The polypeptides of the invention may be added 
to the feed (directly or as an additive or ingredient) or treated cellulose (e.g. glucan) may be 
added instead. 

A particularly preferred method for the (exogenous) addition of the B-glucanase is to add the 
polypeptide of the invention as transgenic plant material and/or (e.g. transgenic) seed. The 
polypeptide may thus have been synthesized through heterologous gene expression, for example 
the gene encoding the desired enzyme may be cloned in to a plant expression vector, under 
control of the appropriate plant expression signals, e.g. a tissue specific promoter, such as a seed 
specific promoter. The expression vector containing the gene encoding the polypeptide can be 
subsequently transformed into plant cells, and transformed cells can be selected for regeneration 
into whole plants. The thus obtained transgenic plants can be grown and harvested, and those 
parts of the plants containing the heterologous (to the plant) polypeptide can be included in one 
of the compositions, either as such or after further processing. General methods for the 
(heterologous) expression of enzymes in (transgenic) plants, including methods for seed-specific 
expression of enzymes, are known 23 . The heterologous polypeptide may be contained in the seed 
of the transgenic plants or it may be contained in other plant parts such as roots, stems, leaves, 
wood, flowers, bark and/or fruit. The plant may be a monocot or a dicot. Suitable plants include 
cereals, such as oats, barley, wheat, maize and rice. Preferably the polynucleotide of the 
invention is stably incorporated into the plant genome. 

The addition of the polypeptide in the form of transgenic plant material, e.g." in transgenic 
seed may require the processing of the plant material so as to make the enzyme available, or at 
least improve its availability. Such processing techniques may include various mechanical (eg. 
milling and/or grinding) techniques or thermomechanical treatments such as extrusion or 
expansion. 

The present invention thus also relates to a process for promoting growth and/or feed 
conversion in a monogastric or non-ruminant animal, the process comprising feeding the animal 
polypeptide of the invention. Suitable animals include farm, monogastric and/or non-ruminant 
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animals such as pigs (or piglets), poultry (such as chickens, turkeys), calves or veal or aquatic 
(e.g. marine) animals (for example fish). 

Assays for Cellulose Degrading Enzymes 

Also within the present invention is the use of polypeptides according to the invention in 
screening methods to identify compounds that may act as agonists or antagonists which may 
modulate the B-glucanase. In general terms, such screening methods may involve contacting a 
polypeptide of the invention with a test compound and then measuring activity or incubating a 
polypeptide of the invention with a test substance and then detecting any modulation of 
B-glucanase activity. Agents which bind to the polypeptides of the present invention can also be 
identified by binding assays. 

Modulator activity can be determined by contacting cells expressing a polypeptide of the 
invention with a substance under investigation and by monitoring the effect mediated by the 
polypeptides. The cells expressing the polypeptide may be in vitro and preferably, the assay is 
carried out in vitro using cells expressing recombinant polypeptide. 

The assays and substrates described herein have allowed identification and confirmation of 
B-glucanase activity. However, these assays can be used to detect other cellulose degrading 
enzymes, whether or not they have fi-glucanase activity. The substrate that can be used for this 
assay can comprise B-glucan. 

Another aspect of the invention relates to an assay for identifying or detecting a polypeptide 
which is able to degrade cellulose. The activity may be a glucanase (e.g. B-glucanase) or 
cellulase or xyloglucanase. The. assay may comprise: 

(a) providing, as a substrate for a candidate compound (usually a polypeptide) the 
substrate described in the previous paragraph; and 

(b) contacting the substrate with the candidate compound, and detecting whether any 
carbohydrates are produced. 

The amount of these carbohydrates can be measured. If necessary, they can then be 
compared to the amount of the carbohydrates produced in a control experiment, in the absence of 
candidate compound. 

The above assays can be employedto identify modulators of the B-glucanase activity. 
Preferred features and characteristics of one aspect of the invention are applicable to another 
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aspect mutatis mutandis. 

The invention will now be described with reference to the following Examples which are 
intended to be illustrative only and not limiting, 

EXAMPLES 

General procedures 

Standard molecular cloning techniques such as DNA isolation, gel electrophoresis, 
enzymatic restriction modifications of nucleic acids, Southern analyses, E. coli transformation, 
colony lifts and filter hybridisations etc., were performed using standard techniques. 1 - 2 Synthetic 
oligo deoxynucleotides were obtained from ISOGEN Bioscience (Maarssen, The Netherlands). 
DNA sequence analyses were performed on an Applied Biosystems 373A DNA sequencer, 
according to the supplier's instructions. 

DNA labelling and hybridizations were conducted according to the ECL™ direct nucleic 
acid labeling and detection systems (Amersham LIFE SCIENCE, Little Chalfont, EnglandVor 1 
according to the standard radioactive labeling techniques. 1 

Example 1 : RNA isolation from T. emersonii and synthesis of cDNA / 

T. emersonii strain CBS 393.64 was fermented under cellulase-inducing conditions. At 
several time points mycelium and culture supernatants were harvested by filtration using 
Miracloth filtration wrap. Mycelium was washed extensively with demineralized water and 
squeezed between paper towels to remove excessive water. Mycelium from selected time points - 
(based on the cellulase measurements in culture supernatants) was frozen immediately in liquid 
nitrogen and ground to a fine powder using a mortar and pestle. The resulting powder was 
transferred to a sterile 50 ml tube and weighed: for every 1-1 .2 g of ground mycelium 10 ml 
TRIzol reagent (Gibco/BRL) was added (max. 25 ml per tube). The mycelial powder was 
immediately solubilised by vigorous mixing (vortexing, 1 min), followed by 5 minutes at room 
temperature incubation with occasional mixing. A 0.2 (original TRIzol) volume of chloroform 
(thus 2 ml for every 10 ml TRIzol used originally) was added, vortexed and left at room 
temperature for 10 min. Subsequently, the mixture was centrifuged at 4°C, 6000 g for 30 
minutes. The top aqueous phase was transferred to a fresh tube and total RNA was precipitated 
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by addition of a 0.5 (original TRIzol) volume of isopropyl alcohol (thus 5 ml of isopropyl 
alcohol for every 10 ml TRIzol used originally). After 10 minutes of precipitation at room 
temperature, the RNA was recovered by centrifugation for 30 minutes at 6000 g. On removal of 
supernatant the RNA pellet was rinsed with one volume of 70% ethanol. After removal of the 
ethanol, the RNA pellet was air dried. The dried RNA pellet was dissolved in 3 ml GTS (100 
mM Tris-Cl, pH 7.5, 4 M guanidium thiocyanate, 0.5 % sodium lauryl sarcosinate) buffer. 10 pi 
of RNA solution was used'to determine quality and concentration of nucleic acids. 

Northern analysis was performed 3 and the isolated RNA further purified. 1,3 . For isolation of 
mRNA a modified protocol (using gravity flow instead of centrifugation) of the PHARMACIA 
purification kit (Cat no. 27-9258-02) was used. 3 For cDNA synthesis the STRATAGENE cDNA 
Synthesis KIT was used according to the instructions of the manufacturer, except for a number of 
optimisations for using the pGBFIN vectors with major changes as previously described. 3 

The amount of cDNA synthesised was estimated by TCA precipitation and subsequently 
analysed via electrophoresis in alkaline agarose gels. 3 

Example 2: Preparation of a cDNA library from T. emersonii mRNA 

The cDNA pool obtained in Example 1 was blunted, ligated with adapters and restriction 
enzyme digested/ 

Cloning of the cDNA in the expression vector pGBFIN- 1 1 (see WO-99/326 1 7 for the 
construction of this vector) requires the presence of a EcoBI site on the 5 - and of an Xlw\ site on 
the 3'- end of the cDNA. Therefore, the first strand priming oligonucleotide and the adapter 
sequences used (Pharmacia) were chosen to meet the prerequisites set for the expression vector. 

The cDNAs obtained were separated via size fractionation through a SEPHAROSE CL-2B 
matrix, upon which size of the individual pools obtained were analysed via non-denaturing "gel 
electrophoresis. 3 Two pools of cDNAs, obtained via cut offs at 0.5 kb and 1 .0 kb respectively, 
were selected for construction of the a cDNA library in pGBFIN-1 1. For the pGBFIN- 1 1, a pool 
of completely double-digested (EcoRI-XhoI) pGBFIN- 1 1 vector (background ligation < 1%) was 
prepared. The selected cDNA pools were ligated into the pGBFIN- 1 1 vector and transformed 
into E. coli XLIO-Gold bacterial cells to generate two primary cDNA libraries. Transformation 
frequencies of the two pools were both > 1.0 x 10 6 . From a fraction of both the E. coli cDNA 
libraries, colonies were selected randomly and plasmid DNA was isolated. Analysis of this 
plasmid DNA demonstrated that both cDNA libraries had insert percentages between 90 and 
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95%. 

Furthermore, colony lifts were performed from a fraction of the library and the generated 
filters were subsequently hybridised with the T. emersonii gpdA gene, encoding the 
glyceraldehyde-3-phosphate dehydrogenase gene. Next, plasmid DNA was isolated and via 
restriction analysis it was demonstrated that all plasmid contained single inserts in the correct 
orientation. Sequencing of the 5' ends of the cDNAs within these T. emersonii gpdA containing 
plasmids demonstrated that > 85% was full length. 

Example 3: Transformation of the expression library to A.niger 

DNA was isolated from the E. coli cDNA library as described earlier. Total plasmid DNA 
was digested for 4 hours at 37°C with Notl to remove E. coli derived plasmid sequences. After 
. purification, the DNA was dissolved in sterile demineralised water. 

Multiple A. niger DS2978 transformations were performed 3 using 1.5 x 10 7 B 3.0 x 10 7 
protoplasts and 10 ug of plasmid DNA per transformation. Transformants were selected for the 
presence of the amdS selection marker by growth on acetamide as the sole N-source. Since both 
the amdS selection marker and the cDNA expression cassette are present on the integrating 
fragment growth on acetamide is indicative for the presence of a cDNA expression cassette. 

After approximately 7-10 days incubation at 30°C, 10,000 transformants were purified: the 
Aspergillus niger transformants were transferred robotically (FlexysJcolony picker automater) 
from the transformation plates towards 96 wells MTP Master Plates (MPs) containing 150 \il per 
well of solidified selective medium (SM) (per 1000ml: 0.52 g KC1, 1.52 g K 2 HP0 4 , 0.52 g 
MgS0 4 , 20g glucose, lg acetamide, 0.1M MES buffer, 15g agar, 1ml of trace element solution 
[trace elements solution (containing per 1 liter): 2.2g ZnS04/7H 2 0, l.lg H 3 B0 3 , 
0.5gFeSO 4 /7H 2 O, 0.17g CoCl 2 /6H 2 0, 0.16g CuS0 4 /5H 2 0, 0.15gNaMoO 4 /2H 2 O, 5.0g EDTA, pH 
6.5] pH 5.5. The transformants were grown on SM for 5 days at 34°C. The thus generated set of 
MPs was used to 1) inoculate MTPs for growth and subsequent enzyme detection and 2) backup 
plates (BPs) of the cDNA library which were stored at -80°C. 

Example 4: Analysis of the T. emersonii expression library 

5 days-old grown MPs were used as replication template and replica plated on fresh 
selective medium (SM) plates, containing per litre: 0.52 g KC1, 1.52 g K 2 HP0 4 , 0.52 g MgS0 4 , 
20g glucose, lg acetamide, 0.1M MES buffer, 15g agar 1ml of trace element solution [trace 
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elements solution (per 1 litre): 2.2g ZnS0 4 /7H 2 0, l.lg H 3 B0 3 , 0.5gFeSO 4 /7H 2 O, 0.1 7g 
CoCl 2 /6H 2 0, 0.16g CuS0 4 /5H 2 0, 0.15g NaMo0 4 /2H 2 0, 5.0g EDTA, pH 6.5] pH 5.5. 

Once inoculated the plates were incubated at 34°C for 48h. Subsequently the plates were 
filled up with a top-agar-containing carboxymethylcellulose (CMC) (5g agarose, 0.5g CMC 
(Sigma ref C4888) prepared in 1000ml of 50 mM phosphate buffer pH 7). Once the top agar 
solidified, the plates were left at 65°C for 4 hours. For the activity visualization, the plates were 
stained with a Congo red solution (10 g Congo red in 1000ml phosphate buffer pH7) for 15 
minutes. The staining solution was discarded and the plates were washed with 1M NaCl (58.44g 
in 1 litre of distilled water). This latter step was repeated twice. Positive clones appeared by 
forming a pale clearance halo on a red background. 

The positive cellulase clones from this first screen (demonstrating a clear halo after congo 
red screening) were re-inoculated on fresh SM medium and grown for 5 days at 34°C. The thus- 
obtained template plate was then replicated on selective medium and on selective medium 
containing 0.075% (w/v) of AZCL-cellulose (Megazyme catalogue ref. I-AZCEL). The SM 
plates were treated and scored as described previously (growth at 34°C and subsequent screening 
via cellulose-containing overlay and Congo red straining) whereas the SM-AZCEL-cellulose 
plates were incubated at 34°C for 48 h and then further incubated at 65°C for 8 h. The SM- 
AZCL-cellulose plates were scored before and after the high temperature incubation. The 
positive cellulase clones resulted in a diffuse blue halo. 

Finally, 20 positive cellulase clones were identified. Cellulase producing Aspergillus 
transformants, as identified in the xylanase plate assay, were grown in shake flask fermentation. 3 
Medium samples were taken after 5 days of fermentation and analysed for cellulase activity as 
described later. 

Example 5: Genetic analysis of positive transformants 

Positive (re-confirmed) transformants identified were grown on liquid medium, the 
mycelium was harvested and total (chromosomal) DNA was isolated using the Puregene 
Isolation System (Biozym B.V.) for DNA isolation from filamentous fungi. DNA Isolation and 
purification were performed according to the suppliers' protocol, but slightly modified: protein 
precipitation steps 3 and 4 were repeated^ 

Chromosomal DNA was used as a" template in a PCR reaction using primers 12207 (SEQ ID 
No. 8) and 1 1937 (SEQ ID No. 7) to amplify the insert(s) present in the expression cassette 
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integrated into the chromosomal DNA. 

Direct PCRs on transformants were performed according to an adapted version of a known 
protocol 4 where the mycelium obtained was subsequently treated with GlucanexJ (Novo 
Nordisk) at 5 mg/ml concentrations instead of the NOVOzyme. 

PCR reactions contained eLONGase™ B buffer (Life Technologies, Breda, The 
Netherlands), dNTPs (200 uM of each), 1 ul eLONGase™ Enzyme Mix, 1-5 ul template, and 
10-30 pmol of each oligo, in a final volume of 50 ul. The optimal amount of oligos was 
determined experimentally for each purchased batch. On average, 10 to 30 pmol was used. 
Reactions were performed with the following cycle conditions: lx (2 min )94°C, 35x (1 min 
94°C, 1 min 55°C, 6 min 72°C), lx (7 min 72°C). Samples were loaded on agarose gels for 
analyses of PCR products. 

The thus obtained PCR product was subcloned in the E. coli pcr2. 1 cloning vector 
(Invitrogen, according to the supplier's instructions), resulting in plasmid pGBCEA-1. 

The subcloned PCR product was sequenced. The resulting nucleotide sequence of the 
coding region is depicted in SEQ ID NO 1 and the deduced amino acid sequence of the protein in 
SEQ ID NO 2. This protein has been named CEA. 



Example 6 : Viscometric screening of a cDNA Expression Library using the Hamilton Visco- 
robot. 

Viscometric measurements 

Capillary viscometers are the most commonly used type of viscometers used for the 
measurements of viscous liquids. In general, the liquid of interest is made to flow through a 
capillary tube under a known pressure difference. Then the rate of flow is measured, usually by 
noting the time taken for a given volume of liquid to pass a graduation mark. Other types of 
capillary viscometers force the liquid through a capillary at a predetermined rate of flow, and the 
pressure drop thereby produced across the capillary is measured in order to determine the liquids 
viscosity. Controlled enzymatic degradation of polymeric viscous solutions may be used to 
determine the enzyme activity. Providing the numeric relationship between the viscosity and the 
concentration of the liquid is known, kinetic parameters such as the Michaelis-Menten constant 
can be estimated. 
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I. The Detection System 

(a) The setup of the Hamilton viscorobot 

The Hamilton pipetting robot (Hamilton Workstation Microlab 2200, Hamilton 
Company, Reno, USA) was controlled by a software program called Eclipse (Hamilton 
Company). This software enables the user to direct the probe-arm to a certain position within the 
Hamilton workspace. A standard visco-assay Eclipse program was developed. This program 
works on 96-welled MTPs, where each well can be addressed individually. The syringes are 
used to aspirate or dispense sample liquid at an Eclipse-specified position through the needle 
orifice. Both the aspiration and dispension velocity (in sec/ml) as well as the aspiration and 
dispension volumes (in jj.1) were specified at will. By implementing rack definitions, multiple 
reagent containers can be accessed in an efficient and timesaving manner. The carry-over of 
sample liquid from one pipetting step to the next was avoided by instructing Eclipse to rinse the 
tubing with system fluid. The aspiration velocity of 10 sec/ml (sometimes 15 sec/ml) was 
suitable to distinguish most viscosities by looking at the peak heights. Usually 200 - 250pi of 
sample liquid was aspirated and dispensed. 

(b) Setup of the Hamilton 

. Aspiration of sample liquid produces a pressure drop throughout the water-filled 
tubing. The magnitude of the pressure drop depends on both the aspiration (or plunger 
'' movement) velocity and on the viscosity of the sample liquid. The greatest pressure drop is 
created in the needle tip, since the radius is smallest (about 0.3 mm). Aspiration of sample 
liquid causes pressure changes, which are relayed from the T-junction to the pressure 
transducer. The transducer (Depex B.V., de Bilt, The Netherlands) converts the magnitude of 
the underpressure into an electric current. The electric current is read by the Datataker once 
per second, and subsequently stored in the Datataker's memory in a digital format. A 
computer that is connected to the Datataker enables (via the Delogger software) the download 
of the Datataker memory into files. They in turn can be read and analysed by common 
spreadsheet programs (such as Microsoft Excel). Since the pressure was measured with time, 
the final output can be visualised by plotting the magnitude of the underpressure (in mV) 
against the time (in seconds). Upon displacement of the T-junction to a position right behind 
the needle, the pressure signal is less dependent on the aspiration volume. 
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(c) Preparation of calibration curves allowed the determination of the viscosity of any liquid. . 
According to Poisseuille's Law, the measured pressure drop should be directly 

proportional to the viscosity, since 

P=(mV)F r = ri — - where k = — 

Ttr k 81Q 

i ' 

With this formula it possible to relate the milli-Volt underpressure output to the unknown 
viscosity of the sample liquid. This was done by aspirating a defined constant volume of 
calibration liquid (usually glycerol, as it covers a wide range of viscosities and behaves in a 
Newtonian manner) at various concentrations. The underpressure thus created relate to the mV 
output by an unknown factor F. Since the absolute viscosities of various glycerol concentrations 
are known, a plot of the mV underpressure output for various known glycerol concentrations 
against the known absolute viscosities of the same glycerol concentrations produces a straight 
line through the origin (since P = (mV)*F = r|/k). 

(d) Eclipse program used for Full-Scale Screening. 

A single measuring cycle consists of the aspiration of 250|ul sample and supernatant 
substrate mix) at an aspiration velocity of lOsec/ml. Prior to the aspiration of sample fluid, 40jnl 
of air was aspirated to separate system from sample fluid. Once the aspiration of sample liquid 
was complete, it is followed by dispension of 290^1 of sample and air. This measuring cycle was * 
repeated six times, and followed by a 5ml washing step in order to clean the tubing. 

II. Assay Development 

(e) Establishing the optimum substrate mixture of pectin andxylanfor screening. 

Using more than one viscous substrate at a time had several advantages. One can screen 
for different types of enzymes at the same time. This saved a large amount of effort and time 
while less substrate was required. In addition to oat spelt xylan it was decided to use pectin as 
the second substrate. A 1:1 solution of 1% pectin and 7% oat spelt xylan seemed to be the most 
suitable for screening. Since there were proportionally few positive clones in the library, most 
peaks had an approximate height of 435mVlJn case of a positive xylanase sample, all or most of 
the xylan was be degraded so that only 0.5% pectin was left. Hence, a positive xylanase clone 
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produced a peak height of 280mV. In case of a pectin degrading clone, the peak height was 
220mV. 

(f) Determination of the minimal xylanase activity necessary to be detected with the 
Hamilton viscorobot 

The screening was carried out with very small volumes of both substrate and supernatant, 
since a single MTP well holds at most 360|al. Further, the more supernatant added, the more the 
substrate was diluted, and the more the initial viscosity is reduced. 250|il of 6% oat spelt xylan 
was dispensed into the wells of an MTP. Dilution of a series of a reference xylanase 
(A. tubigensis xylanase with an activity of 685,400 EXU/g, where 1 EXU =4.53 mole reducing 
sugars /min/g) were prepared and 30\xl of each dilution was added to the substrate. After 
incubation for 24 hours at 50°C, the viscosity of each sample was measured. The lowest 
detectable enzyme concentration corresponds with 53.6ng/ml or 0.0367 EXU/ml. Hence, the 
addition of 20jj1 of supernatant to 300jj,1 substrate will enable the detection of extremely low 
enzyme activities. 

(g) Setting up the screening for the identification of thermostable enzymes. 

In order to pick up only thermostable enzymes from the library and in order to avoid 
interference of host-Arager enzymes activity, the plates with the clones containing the 
thermostable candidate enzymes were subjected to heat treatment. The system was validated 
using empty host strains, host strains expressing thermo-labile enzymes and host strains 
expressing thermostable enzymes. After growth of the strains and production of enzyme the 
MTP plates were heated at 72°C for 30 minutes. Subsequently 20(jJ of supernatant was added to 
300|al of 6% oat spelt xylan. Along with negative controls (addition of 20|ul of water), the plates 
containing the samples were sealed with sticky tape and incubated at 60°C in the oven. The high 
temperature may increase any thermostable enzyme activity but destroys both the background 
host- A.niger enzyme interfering activity and the non-thermostable enzyme activities expressed 
by the library. After 20 hours of incubation, no notable peak height- decrease was found for the 
thermo-labile xylanase clones, indicating that host-xylanase activity has been thoroughly 
inactivated. In addition clones of the thermolabile xylanase enzyme xynB of Aspergillus niger 
were included as a control. In this case no residual activity could be detected, while on the other 
. hand the heat resistant xylanase which was included as a control was still active. Heat 
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inactivation for 30 minutes at 72°C and sample incubation times of 24 hours or less enabled us to 
specifically detect thermostable T.emersonii xylanases. 

III. Screening of the cDNA expression library using the Hamilton Visco-robot 
(h) Replication of the cDNA expression library and expression of the library. 

One replication cycle involves the two-fold inoculation of both selective medium (SM) 
(containing per litre: 0.52g KC1, 1.52g K 2 HP0 4 , 0.52g MgS0 4 , 20g glucose, lg acetamide, 0.1M 
MES buffer, 15g agar 1ml of trace element solution [trace elements solution (per litre): 2.2g 
ZnS0 4 /7H 2 0, l.lg H3BO3, 0.5gFeSO 4 /7H 2 0, 0.1 7g CoCl 2 /6H 2 0, 0.1 6g CuS0 4 /5H 2 0, 0.1 5g 
NaMo0 4 /2H 2 0, 5.0g EDTA, pH 6.5] pH 5.5) and liquid growth medium (GM) (containing, per 
litre, 70g glucose, 25g casein hydrolysate, 12.5g yeast extract, lg KH 2 P0 4 , 0.5g K 2 S0 4 , 2g 
MgS0 4 , 0.03g ZnCl 2 , 0.02g CaCl 2 , O.Olg MnS0 4 , 0.3g FeS0 4 , pH 5.6). The library was stored in 
standard MTPs, where sporulated recombinant A.niger colonies grew in each well on SM in the 
presence of 10% glycerol. During storage, these plates (master plates, MPs) were kept in a 
frozen state (-80°C). Prior to the replication of the MPs, they were defrosted for one hour in a 
sterilised fume cupboard to avoid microbial contamination. Replication of the MPs was 
performed with the PBA Flexys-Colony Replicator. 5 days-old grown Master Plates were used as 
replication template and replica plated in 96-welled MTPs filled with liquid growth medium 
(GM). The freshly inoculated SM agar plates were placed in an incubator at 32°C and 
subsequently stored at -80°C after addition of 1 50pJ of 1 0% glycerol per well. The GM 
production plates were incubated in a water-saturated Tomtec Quadrastor Shaker 96000 (32°C). 
The plates showed growing mycelium after 2-3 days. The GM plates were incubated for 6 days. 

(i) Sampling of the supernatants after the growth is complete. 

After 6 days of growth, the remaining GM liquid growth medium (containing 
extracellular expression products) was extracted and transferred into fresh MTPs. The 
supernatants were transferred into fresh MTPs using a 4-channel Tecan pipetting robot. The 
average volume of supernatant recovered was between 120 andl40jil. These supernatant plates 
were frozen and were subsequently assayed for xylanase activity. 

G) Preparations for the full-scale screening. 

The supernatant plates from the expression library were defrosted and placed in a closed 
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water bath at 72°C for 35-40 minutes. Using a 12-channel pipette and commercial pre-prepared 
pipette tips in disposable MTP-format racks (supplied by Eppendorf), 20fj,l of each supernatant 
was transferred to the substrate-mixture containing MTP which had been heated to 60°C before 
adding the supernatants. The substrate mixture consisted of a 1 : 1 mixture of 1% pectin (see (k) 
below) and 7% oat spelt xylan (see (1) below), with a final pH of 4.12. The assay MTPs were 
filled with 310ml of substrate/well. Mixing of supernatants with the substrate mixture was 
achieved by stirring the pipette-tips in the sample wells directly after dispensing the supernatants. 
Subsequently the plates were placed in a 60°C oven for an incubation period of 18-24 hours. 
After the incubation, the MTPs were allowed to cool down and screened for a viscosity decrease 
using the Hamilton visco-robot (Hamilton Company, Reno, USA). 

(k) Preparation ofl% pectin, pH 4. 1 

A citric acid-phosphate buffer (Mc Ilvaine buffer) was prepared by adding 0.2M 
Na 2 HP0 4 solution to 250 ml of 0.1M citric acid solution until a pH of 5.5 was obtained. This was 
made up to one litre with dstilled water, and the pH was readjusted if necessary. A 0.5% pectin 
solution was made by slowly adding 0.5g of highly methylated pectin (type Ruban Brun) to a 
flask containing 50ml of the above Mcllvaine buffer and 25ml distilled water at 60°C. Vigorous 
stirring ensured that the pectin dissolved well. This was made up to 100ml, and the pH checked 
again. The pectin used here lowered the pH of the solution, so under these conditions the 
solution had a pH of 5. 1 . If the pectin solution was cloudy, it was useful to centrifuge the 
solution at 15g for 15 minutes in order to remove any undissolved particles. 

(1) Preparation of 7? / o alkaline-treated oat spelt xylan, pH 4. L 

20ml of 2M NaOH was heated up to 60°C in a 1 50ml beaker. In a separate beaker 7g of 
oat spelt xylan (from Sigma company) was added to 20ml of water, so that a bright brown, 
doughy mass formed. If necessary, more water was added , but no more than 30 ml altogether. 
Using a steel spoon, this water-oat spelt xylan mass was slowly added to the hot NaOH solution. 
A powerful stirring device was required to dissolve the xylan in the hydroxide solution, thereby 
keeping the temperature at a steady 60°C. Once all the xylan had dissolved, the solution turned 
dark-brown and resembled a very viscous, clear syrup. Then the rest of the water was added so 
that an overall amount of 50ml had been* added. The solution was allowed to cool down and 4N 
HC1 added until the desired pH was achieved (usually pH 4.1-4.2). The solution was made up to 
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100ml and the pH readjusted if necessary. Centrifugation at 20,000g for 15 minutes at 10°C 
produced a clear yellowish supernatant (whose viscosity depended on the initial amount of oat 
spelt xylan added). " 

(m) The screening of the library 

The outcome of the screening of the Talaromyces emersonii library cloned into A.niger 
was 1 19 graphs corresponding with the 1 19 MTPs tested. Each graph showed the viscosity of 
the 96 wells represented as 96 peaks which measure the underpressure per well in mV. The 
graphs were analyzed for low peaks, which indicated reduced viscosity. Peaks that are lower than 
the average peak height were selected for retesting. Some plates produced very little variation, so 
that selection of putative positive clones for retesting was easy. Other plates showed a great 
amount of variation, hence often more than 5 or 6 putative clones were selected for retesting. To 
have a positive control, after the incubation lOul of an endoxylanase was added to a random 
plate at a fixed position. From those positive controls, it was found that if there is xylanase 
activity, we should expect a peak height between 240 and 280mV. 

The plates of the Talaromyces library which were tested also contained five confirmed • 
thermostable xylanase clones which were found before using a dye detection assay based on 
solubilisation of dyes which were chemically attached to the insoluble polymeric substrate. 
Without exception, those five clones were independently found with the viscometric assay 
already in the primary screening round. All peaks for the known clones were on a level much 
lower than the rest of the peaks (at 250mV). Overall, 118 culture wells were selected for 
retesting, excluding those with previously found xylanase clones. 

(n) Retesting of 118 putative clones. 

Retesting was based on the viscometric assay according to the principle used with the 
full-scale screening. This time, however, a greater assay volume was utilised, since a clearer 
distinction between thermostable enzyme activity and irregular low peaks could be made. Two 
large MTPs (2ml/well) were filled with 1 .2ml of 9% oat spelt xylan. To the first two wells of 
each row 50ul of water was added (negative control). To the last well of each row, 50ul of an 
reference endoxylanase solution was added in order to have a positive control, too. Wells 3 to 1 1 
of each row were used for the retesting of putative xylanase clones (addition of 50ul 
supernatant). After mixing and incubation for 24 hours at 60°C, the viscosities were measured 
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with a specifically designed Eclipse program (Aspiration volume: 800pi, Aspiration velocity: 
lOsec/ml). 

Several positive clones were identified since their peaks were exactly on the same level as 
the positive controls, indicating complete degradation of xylan. In total thirteen putative strains 
were identified in the viscoscreen retest. The cDNA inserts were directly cloned in the pCR2.1 
vector via PCR amplification executed on chromosomal DNA. From 12 library strains sixteen 
cDNA inserts were obtained, using pwo-polymerase and primer set 2 (1 1937/12207, SEQ ID 
Nos. 7 and 8). The orientation of the cDNA insert in the pCR2.1 vector was determined by Xhol 
digestion, which was located in the vector and at the 3 5 of the cDNA. DNA sequence analysis 
was executed on the 5' end of the cDNA inserts. 

The retesting for possible pectin degrading enzymes was performed in a similar fashion 
as for the xylanase, except that 1% pectin was used instead of 9% oat spelt xylan. Only ten 
possible clones were retested, which were those that produced very low peaks in the primary 
screening round but did not show any activity during the xylanase retesting. Reproducibly one 
clone produced a pectin-degrading enzyme. The measured pressure value of 200mV corresponds 
to the viscosity of pure water- Since the pectin used was not fully -methylated, it might have been 
degraded by a thermostable polygalacturonase, pectate lyase or pectin lyase. 

(o) Benefits of Viscoscreen 

The development of a viscometric screening method to identify thermostable enzymes 
was facilitated by three key factors. Firstly, the possibility of thermo-inactivation of native host- 
xylanase activity allowed a much faster development of appropriate reaction conditions. 
Secondly, the fact that when clones with known thermo-labile enzymes were tested the 
thermoinactivation destroyed all activity so that only the more thermostable enzymes were 
picked up. In the third place the elevated temperatures increased the enzymatic activities so that 
the screen became even more sensitive. On Oat Spelt Xylan (Sigma) eleven clones were 
identified which were able to reduce viscosity significantly and on pectin one definite pectin- 
degrading clone was identified. Additionally, five thermostable xylanase clones which were 
previously identified were independently re-discovered. 
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Example 7: C haracterization of first Talaromvces emersonii thermostable fl- p lucanase fCEAl 
from A. riiger transformant 

Activity assays and definitions 

Cellulose PAHBAH Unit (CPU) definition 

One unit of cellulase activity is defined as the amount of enzyme required to release one 
umol reducing sugars produced per minute at pH 5.0 and 60°C at a substrate concentration of 5% 
Carboxy Methyl Cellulose (CMC), using a calibration curve of glucose. 

Enzyme activity according to the CPU method was measured by detecting reducing 
sugars using 4-hydroxybenzoic acid hydrazide (PAHBAH). The assay is based on Lever, 
M., Powell, J.C., Killip, M., Small, C.W. (1973) J. Lab. Clin. Med. 82, 649-655 with some 
modifications. The modification is to the PAHBAH reagent as follows : 0.05 M trisodium 
citrate, 0.1 M Na 2 S0 3 , 0.02M CaCl 2 , 0.5M NaOH and 0.1M p-hydroxybenzoic acid hydrazide 
(PAHBAH). Final pH was 12. The reagent containing PAHBAH in alkaline solution, stored at - 
room temperature, should be used within one day. Glucose was used as the reference reducing 
sugar (calibration curve). For the calibration curve of this assay, the final concentrations of 
glucose were between 0-300 mM. The CPU activity was assayed by mixing 100 ul of enzyme 
solution with 400 ul of 5% CMC in 0.1 M sodium acetate buffer (pH 5.0). Eppendorf cups with 
the substrate (CMC) were preincubated for 5 minutes at 60°C. The reaction was started by 
adding the enzyme solution. After .15 minutes the reaction was stopped by adding 1 .0 ml 
PAHBAH-reagent. The Eppendorf cups were heated for 5 minutes at 100°C and then cooled on 
ice. Samples were centrifuged at the appropriate speed in order to spin down any solid materials 
(1 minute at full speed in a Beckman Microfuge E). The absorbance was measured at 420 nm. 
A blank was prepared by adding 100 ul 0.1M sodium acetate buffer instead of enzyme solution. 

Beta Glucanase Unit (BGU) definition 

Beta Glucanase Unit is the activity required to liberate 0.258 umol reducing sugars 
(measured as glucose equivalents) per minute at pH 3.5 and 40°C, at a substrate concentration of 
0.5% (3-glucan from barley. 
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So in addition to the above determination of cellulase (CPU) activity, a more specific 
assay was earned out for detecting fl-glucanase activity. The principle of the assay is the 
velocity at which the viscosity decreases of a solution of barley B-glucan medium viscosity 
(Megazyme, Australia, 2/1 1 Ponderosa Parade, Warriewood NSW 2101) upon addition of a 
certain amount of enzyme. Barley B-glucan, medium viscosity (Megazyme Australia), was 
dissolved in 0.425M sodium citrate buffer pH 3.5 to a concentration of 6.25mg/ml. . The 
substrate was incubated at 40°C for 10 minutes. Subsequently a small amount of enzyme (in the 
range 0.005-0.062 Units/ml) was added and the reaction allowed to proceed. At 60 minutes 
reaction time the viscosity of the sample was determined relative to a reference sample which 
was incubated with a standard endo-glucanase of known enzymatic activity. Absolute activities 
for the standard were determined by a reducing sugar method using Fe-III-hexacyanide and 4.76 
mg/ml barley B-glucan as initial substrate concentration. 

Definition of Endo Xylanase Unit (EXU) 

The unit of xylanase activity (EXU) is defined as the amount of enzyme (endol endo- 1,4- (3- 
xylanase from Asp. niger ,. as decribed in EP-A-0 ,463,706 (Gist-brocades B.V.)) that liberates 
4.53(amol reducing sugars (measured as xylose equivalents) per minute under assay conditions. 
The assay conditions comprise: 5mg/ml arabinoxylan from wheat flour (Mega2yme, Australia, 
2/11 Ponderosa Parade, Warriewood NSW 2101) in lOOmM sodium citrate buffer (pH 3.5), 
temperature 40°C, at a reaction time of 60 minutes. Reactions were stopped by adding 1M 
NaOH. Detection was done colorimetrically at 420nm after incubating the samples with Fe-III- 
hexacyanide for 15 minutes in boiling water. The hexacyanoferrate reagent was made up by 
dissolving 1.17g KFe(CN) and 19. 5g anhydrous sodium carbonate in 1 litre of water. 

In addition to the above absolute determination of xylanase activity, a relative method 
was used that followed the decrease in viscosity of a solution of wheat arabinoxylan (Megazyme, 
Australia, 2/1 1 Ponderosa Parade, Warriewood NSW 2101) upon addition of a certain amount of 
enzyme. Wheat arabinoxylan was dissolved in 0.425M sodium citrate buffer (pH 3.5) to a 
concentration of 8.3mg/ml. The substrate was incubated at 55°C for 10 minutes. Subsequently a 
small amount of enzyme (in the range 0.01-0.05 Units/ml) was added and the reaction allowed to 
proceed. After 60 minutes reaction time the viscosity of the sample was determined relative to a 
reference which was incubated with a Aspergillus niger endo-xylanase standard of known EXU 
activity (EP-A-0,463,706). Absolute activities in EXU for the standard were determined by 
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reducing sugar method using Fe-III-hexacyanide as described above. Viscosity was determined 
manually using a Haake falling ball viscosity apparatus. 

Definition of Cellulase Unit (CXU) 

The unit of beta-glucanase activity (BGU) is defined as the amount of cellulase that 
hydrolyses in one hour a number of glycosidic bonds equivalent to the production of 0.5 mg 
glucose under the conditions of the assay. The assay conditions comprise: 9mg/ml 
carboxymethylcellulose in 5mM sodium acetate buffer (pH=4.6), temperature 37°C. Liberated 
gluxose was determined by reducing suger method using the DNS reagent. 

In addition to the above absolute determination of cellulase activity, a relative method 
was used that followed the decrease in viscosity of a solution of carboxymethylcellulose upon 
addition of a certain amount of enzyme. Carboxymethylcellulose was dissolved in 5mM sodium 
citrate buffer (pH=4.6) to a concentration that depends on the viscosity of the batch. The 
substrate was incubated at 37°C for 10 minutes. Subsequently a small amount of enzyme was 
added and the reaction allowed to proceed. After 60 minutes reaction time the viscosity of the 
sample was determined relative to a reference which was incubated with a standard of known 
CXU activity. : 

Activity wrtpH 

The T.emersonii B-glucanase was produced by the appropriate A.niger transformant(s) in 
shake flasks as described in Example 3 after growth mycelium was removed by filtration. The 
filtrate was used for this experiment. The activity of the filtrate was measured at different pH 
values at a fixed temperature of 60°C. The activity was measured according to CPU method. - 
Instead of using the fixed pH of pH 5.0, the CMC substrate was diluted with a citrate buffer of 
the appropriate pH in order to obtain the pH of measurement. The experiment was repeated 
twice and the results are shown below in Table 1 . The optimal pH of the enzyme was found to 
be between pH 4.5 and 5.0, at about pH 4.8. 
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Table 1 : pH profile at 60°C 



pH 


activity (uM glucose/15 minutes) at 60°C [ 
(duplicate measurements) 


3 


27.2 


27.7 


3.5 


66.6 


64.2 i 


4 


109.3 


102.6 


4.5 


133.8 


120.6 


5 


121.3 


135.5 


5.5 


116.4 


107.4 


6 


68.7 


69.3 



Activity wrt temperature 

The activity of this fl-glucanase was then measured at different temperatures. Activity 
measurements were carried out according to the CPU method at pH 4.0. Instead of incubating at 
a fixed temperature of 60°C, the incubations were performed at various temperatures. The 
experiment was performed twice and the results are shown below in Table 2. The temperature 
optimum was found to be between 80°C and 85°C. The T opt seems to be at about 84°C (although 
it can be S2°C, 83 °C or 85°C, depending on how the lines are drawn and interpolated between 
data points). 



Table 2: Temperature profile at pH 4 



Temperature 
(°C) 


activity (uM glucose/ 15 minutes): 
(duplicate measurements) 


60 


87.8 


102.1 


70 


156.4 


154.6 


80 


208.5 


213.1 


90 


185.4 


176.4 



Activity wrt other enzymes 



In addition the activity at 30-60°C was measured using the BGU method for the 
B-glucanase and compared with a commercially available T.reesei cell free broth having cellulase 
and glucanase activity as a reference. T.reesei cell free broth consisted of a mixture of different 
enzymes among which are one or more B-glucanases but where these activities have not been 
characterized. For comparing the activities of Temersonii fi-glucanase of the invention and 
T.reesei cellulases/glucanases, the enzymes were dosed at about equal activity at 40°C. The 
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activity of T.emersonii B-glucanase A at 40°C was set to one and so other activities are relative to 
this. The results are shown below in Table 3. 



Table 3: relative activity wrt T reesei enzymes 



Temperature (°C) 


Relative Activity: 
B-glucanase (CEA) 


Relative Activity: 
T. reesei Cellulases/glucanases 


30 


0.50 


0.44 


40 


1.00 


1.13 


50 


1.57 


1.32 


60 


2.40 


1.59 



At temperatures above 40°C the T. reesei cellulase/glucanase activity started to level off 
when compared to the T.emersonii B-glucanase. Above 40°C the activitys diverged, CEA being 
more active. Also, while relative activity was higher at elevated temperatures, at moderate 
temperatures activity was maintained relative to the T. reesei mixture. This illustrates the wide 
temperature range over which the B-glucanase is active. 

Purification and Specific Activity 

Purification of the T.emersonii B-glucanase started from the filtrate. About 25 ml cell free 
broth was desalted with a PD10 column and placed on a resource Q 6 ml anion exchange column 
which was equilibrated in 10 mM sodium acetate buffer (pH 5.0). Elution was carried out with a 
linear gradient from 10-300 mM sodium acetate buffer (pH 5.0). (Linear gradient A to B; flow 
rate: 6 ml/min ; run time: 54 min; wavelength monitor: 280 nm, 254 nm, 214 nm). Active 
fractions were collected (20 ml), concentrated with Microsep concentrators (3.5 ml, 10 K) and 
washed twice with 0.1 M sodium phosphate buffer (pH 5.0). The purity of the fractions was 
analyzed by HPL-SEC (size exclusion chromatography) and SDS-PAGE. 

Specific activity measurements 

The calculated molar extinction coefficient of the B-glucanase at 280 nm is 81550 
M^.cm* 1 . The protein concentration of the purified enzyme was derived from E280 
measurements using a specific absorption of E280 lcm =2.33 for lmg/ml. The specific activity of 
the purified sample was determined in BGU (substrate: barley beta-glucan) and was 1027 
BGU/mg. Using the CPU method the specific activity was 628 units/mg. As the activity 
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according to the viscosimetric method yielded a high number in comparison with the reducing 
sugar method, it was concluded that the T. emersonii fl-glucanase exhibits significant 
endo-glucanase activity % A typical exo-glucanase would perform very well in a reducing sugar 
assay while performing very badly in the viscosimetric assay. 

Iso-Electric Point 

IEF-PAGE was performed as follows, Equipment was the Phast system (Pharmacia 
Biotech), IEF 3-9 PhastGel (Pharmacia Biotech). Gels were run and stained (Coomassie) 
according to standard Phast system methods. The Iso-Electric Point was determined on PhastGel 
IEF3-9 and turned out to be 3.3. 

Molecular Weight Determination 

SDS-PAGE was performed as follows. Equipment was again the Phast system 
(Pharmacia Biotech); 12.5% homogeneous gels (Pharmacia Biotech); SDS-buffer strips 
(Pharmacia Biotech). Sample treatment: one volume (5 samples) buffer (500mM Tris-HCl, pH 
6.8, 10% SDS, 0.1% Bromo-phenol blue) was mixed with 4 volumes of sample and boiled for. 
3 minutes. Gels were run and stained (Coomassie) according to standard Phast system methods. 
HPLC-size exclusion chromatography was carried out using Column: TSK G3000SW, cat. no. 
05103 (TosoHaas). Method: eluents were: 0.1 M Sodium phosphate buffer pH 7.0, Flow rate: 
lml/min, Run time: 30 min, Wavelength monitor: 280 nm. Deglycosylation of the enzyme: mix 
5(il purified enzyme (7 mg/ml) with 20 \x\ 0.5% SDS and 25 \x\ 1% fl-mercaptoethanol. This 
mixture was boiled for 4 minutes. After cooling down, 20 \xl N-glycosidase F (500 U/ml) and 
20 |il 3?/o Triton X- 100 in 1 M sodium phosphate buffer pH 7.0 were added. This was incubated 
overnight and the deglycosylation analysed with SDS-PAGE. 

The molecular weight determined on SDS-PAGE gel and HP-SEC turned out to be 
43 kDa. After deglycosylation on SDS-PAGE a straight band was seen at 37 kDa. 

Thermostability 

T50 thermostability was determined with the T.reesei cellulases/glucanases as a 
reference. T50 is the temperature at which, after 20 minutes of incubation, 50% of the activity is 
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left. Table 4 shows the differences in T50 thermostability between the purified fi-glucanase from 
T. emersonii and the T.reesei cellulase/glucanases mixture used earlier in this Example (initial 
activity was set to one: the enzymes were incubated for 20 minutes and after cooling activity 
was measured using the CPU method). Each experiment was repeated twice and so Table 4 
shows duplicate measurements. The T50 thermostability of the purified B-glucanase lies around 
93.4°C and the T50 thermostability of T.reesei cellulase / glucanases around 64.9°C. 



Table 4: thermostability wrt T. reesei enzymes 



Temperature 
(°C) 


Residual Activity (%) 
T. reesei cellulases/glucanases 


Residual Activity (%) 
B-glucanase (CEA) 


40 


97 


98 






50 


96 


100 


100 


99 


60 


85 


83 


100 


99 


70 


14 


14 


98 


95 


75 


0 


0 


93 


96 


80 


0 


0 


88 


90 


85 


0 


0 


89 


94 


90 


0 


0 


65 


64 


110 






0 


0 



Example 8: Activity Measurements 

The two library strains which were identified in the viscoscreen in Example 6, using Oat 
Spelt Xylan as a substrate, were fermented in shake flasks using the GM medium (the original 
strains were labelled AD009.2 1 and AD01 1 .3 1). In addition the strain labelled AD02 1 .B6 
(CEA) was included which was identified in Examples 4 and 5 to express B-glucanase activity. 
Several assays were performed and the results are shown below. Surprisingly strains AD009.21 
and ADO 1 1.31 showed little 'xylanase activity. Instead it seems that AD009.21 and ADO 1 1.31 
have cellulase activity even though the clones were identified in the viscoscreening using oat 
spelt xylan. In particular (3 D009.21 is very active on barley P-glucan. 
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Table 5 : Shake flask fermentation of cellulase strains; 
Xylanase (EXU/ml), fi-glucanase (BGU/ml) and cellulase (CXU/ml) activity. 



Strain (and glucanase designation) 


EXU/ml 


BGU/ml 


CXU/m 
1 


AD021.B6 (CEA) 


< 10 


160 


524 


AD009.21 (CEB) 


< 10 


884 


254 


ADO 11.31 (CEC) 


< 10 


< 10 


7 



The detection limits were set to below 10 for the EXU and BGU assays, and 5 for the 
CXU assay. 

The DNA analysis on strains AD 009.21 and AD 01 1 .3 1 (as described earlier in Example 
6) showed two new fl-glucanases, called CEB and CEC. The nucleotide sequences of the coding 
regions are SEQ ID Nos. 3 and 5 and corresponding amino acid sequences are SEQ ID Nos. 4 
and 6, respectively. 

The CEB strain shows high B-glucanase activity. The CEA strain shows a relative high 
cellulase activity in comparison with the CEB strain. The ratio between the B-glucanase activity 
and the cellulase activity is different for CEA and CEB. The CEC strain showed cellulase 
activity but its B-glucanase or xlyanase activity was below limits of detection in these assays. 
However, homology studies strongly suggest B-glucanase activity, and it is thought that this may 
be much higher at different pH's. 

Using more than one substrate for the screening is feasible. Surprisingly it turned out that 
instead of two substrates at a time, three diffent substrates could be used in the viscoscreen. One 
of the substrates was a glucose polymer impurity, likely glucan or cellulose material. It was 
shown that the use of oat spelt xylan, pectin and cellulose like material resulted in the 
identification of three types of enzymes, xylanases, pectinases and cellulases in one round of 
screening. The viscometric assay holds one other great advantage with respect to the common 
dye detection assay. The dye detection assay bases on dyes chemically linked to a substrate. This 
kind of substrate is commercially available for only a few compounds. The viscometric assay, 
however, can work with any type of viscous substrate, even natural ones that do not require any 
chemical pre-treatment. This aspect is of importance as, from a commercial and industrial point 
of view, one can screen for enzyme activities on substrates whose viscosity-decrease (i.e. fruit 
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juices) or viscosity-increase (i.e. dairy products) is of commercial importance. 

Example 9: Baking performance of the Talaromyces emersonii endoglucanase (CEC) 

Preparation of tin bread in a standard baking process was performed by mixing 3500g 
wheat flour (a mix of 80% Kolibri and 20% Ibis wheat flours (Meneba, Holland) at about 21°C), 
77g compressed (Konings) yeast, 70g salt, 25ppm ascorbic acid, lOppm fungal a-amylase 
Fermizyme™P 200 , (DSM N.V., Bakery Ingredients, Delft, The Netherlands) and different . - 
quantities of the endoglucanase CEC enzyme (from 4 different clones, identified by their AD 
number) and 203 OmL water (8-1 5 °C) in a spiral mixer (Hobart) for 2 minutes (at speed 1) and 
for about 6 minutes (at speed 2) to put in 125Wh (Watt-hours) of energy. The dough temperature 
was 28° C. Machineability of the dough was analysed by hand by a qualified baker. 

Directly after mixing the dough was divided into 6 pieces each of 875g, rounded and 
proofed for 35 minutes in a proofing cabinet at 34°C and 85% RH (relative humidity). At the end 
of this period the doughs were shaped and panned and given a final proof of 75 minutes in a 
proofing cabinet at 38°C and 87% RH. Afterwards the fully proofed doughs were baked in an 
electric oven at 210° C for 30 minutes. After cooling to room temperature the volumes of the 
loaves of bread were determined by the rape seed displacement method. After 16-24 hours 
storage in sealed polyethylene bags at room temperature the crumb quality was assessed by a 
qualified baker. Four library strains which were identified to express CEC were fermented in 
shake flasks and tested in this baking test. The results are shown in the Tables below. 



Dosage level of 


Loaf volume 


Dough handling 


Crumb 


CEC AD 01 1.31 








quality 


(CXU/kg flour) 


(mL) I 


:%) 




(scale 0-7) 


0 


4105 


100 


easy, not sticky 


6 


56 


4357 


106 


easier, not sticky 


6 


189 


4289 


104 


easier, some stickiness 


6.5 



Dosage level of 


Loaf volume 


Dough handling 


Crumb 


CEC AD 046b.21 








quality 


(CXU/kg flour) 


(mL) ( 


:%) 




(scale 0-7) 


0 


4105 


100 


easy, not sticky 


6 


56 


4427 


108 


easier, not sticky 


6 


189 


4419 


108 ' . ; 


easier, some stickiness 


6 
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Dosage level of 


Loaf volume 


Dough handling 


Crumb 


CEC AD 050.13 








quality 


(CXLI/kg flour) 


(mL) i 


(%) 




(scale 0-7) 


0 


4105 


100 


easy, not sticky 


6 


56 


4432 


108 


easier, not sticky 


5.5 


189 


4622 


113 


easier, not sticky 


6 



Dosage level of 


Loaf volume 


Dough handling 


Crumb 


CEC AD 078.50 








quality 


(CXU/kg flour) 


(mL) I 


{%) 




(scale 0-7) 


0 


4105 


100 


easy, not sticky 


6 


56 


4450 


108 


easier, not sticky 


5.5 


189 


4553 


111 


easier, some stickiness 


6 



The quality of the doughs was very good. At the higher dosing for endoglucanase CEC 
there was a little stickiness experienced during handling of the dough. However this little 
stickiness did not influence the dough's machineability. All doughs containing endoglucanase 
CEC were very supple and easy to handle. 

From these baking results it was concluded that endoglucanase CEC is very effective in 
improving bread quality, both in terms of loaf volume and in terms of crumb quality. Despite the 
large volumes of the loaves the crumb structure was still very regular and fine. 

Example 10: Comparison of baking performance of the Talaromyces emersonii enzyme (CEC) 
with endoxylanase from Asp, niger 

The baking performance of CEC was compared in Dutch tin bread production with a 
currently used fungal endoglucanase from Aspergillus niger. This A. niger endoglucanase was 
supplied in its pure commercially available form, i.e. Fermizyme™ HS 2000 . 

The exact procedure of Example 9 was repeated except different quantities of either the 
endoglucanase CEC or Fermizyme™ HS 2000 were used. 



Enzyme 


Dosage 
level ' 
(CXU/kg 
flour) 


Loaf 
volume 

(mL) (%) 


Dough handling 


Crumb quality 
(scale 0-7) 


endoglucanase 
CEC 050.13 


0 


4105 


100 


Good, not sticky 


6 ! 


56 


4432' 


108 


Supple, not sticky 


5.5 


189 


4622 


113 


Supple, not sticky 


6 
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Fermizyme 1 '** 
HS 200 o 


Dosage 

level 
(EXU/kg 
flour) 


Loaf 
volume 

(mL) (%) 


Dough handling 


Crumb quality 
(scale 0-7) 




0 


4123 


100 


Good, not sticky 


6 




527 


4361 


106 


Supple, not sticky 


6 




1056 


4535 


110 


Supple, bit sticky 


7 



From the results it is clear that the endoglucanase CEC improved loaf volume to a larger 
extent than obtained by introducing Fermizyme™ HS 2000 . Moreover, less endoglucanase units/kg 
flour were needed to reach a certain level in loaf volume when CEC was used instead of 
Fermizyme™ HS 2 



^2000- 



Example 1 1 

A comparison of some of the molecular and biochemical properties of the three glucases is 
provided in the Tables below. 



Glucanase 


MW 
(after 
deglycosylation) 


Length 
(amino acid 
residues) 


Activity 


Family No.* 


Optimum 
pH 


Pi 


Optim 
Tempei 
re 


CEA 


43kDa 
(37kDa) 


335 


3.2.1.4 

(endoglucanase) 


5 (3D structure: 
ctSBS TIM 
barrel) 


4.8 


3.3 
(4.5 + ) 


85°( 


2. CEB 


44kDa + 


414 


3.2.1.4 

(endoglucanase) 


7 (3D structure; 
B-jelly roll) 




4.2 | 


>75° 


CEC 


23kDa + 


222 


3.2.1.4 

(endoglucanase) 


45 (3D 
structure: 
mixed 6-barrel) 




4.0 


>75° 



+ Predicted from amino acid sequence 



Glucanase/ 
Cellulase 


Family 


Known 
Activities 


Known Activites 


Conclusion 


CEA 


5 


EC 3.2.1.4 
EC3.2.L75 
EC 3.2.1.58 
EC 3.2.1.78 
EC 3.2.1.123 


cellulase 

endo 1 ,6 glucanase 
exo 1 5 3 glucanase 
mannanase 
endoglycoceramidase 


BGU/CXU activity 
Considered unlikely 


CEB 


7 


EC 3.2.1.4 
EC 3.2.1.91 


cellulase 

cellobiohydrolase 


BGU/CXU activity 
considered unlikely 


CEC | 


45 ■ 


EC 3.2.1.4 


cellulase 


BGU/CXU activity but at 
different pH from that tested 
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Glucanase/ 


Family 


3D 


Nucleophile/ 


Mechanism 


Cellulase 






Proton Donor 




CEA 


5 


TIM 8 


glu/glu 


retaining 


CEN 


7 


jelly roll 


glu/glu 


retaining 


CEC 


45 


mixed barrel 


asp/asp 


inverting 




16 


jelly roll 


glu/glu 


retaining 



CEB, CEC are all cellulases that exhibit EC 3.2.1.4 activity. Many cellulases can also 
hydro lyse 1,4 bonds in barley -glucan which makes them particularly useful for certain 
applications such as e.g. eliminating anti-nutritional factors in feed. As activity was measured 
via the viscosity assay it is likely that the cellulase exhibited endo activity. It cannot be excluded 
however that also 1,3 linkages are hydrolysed. Hence, on the observations, EC 3.2.L73, 3.2.1.39 
and 3.2.1.6 activities cannot be ruled out even though these activities are found in family 16: the 
characterises of families 7 and 16 seem rather similar. 

Purel as explanation, cellulases (EC 3.2.1.4) are usually able to catalyse the endo hydrolysis 
of 1,4-D-glucosidic linkage in cellulose. The systematic name of cellulases is 1,4-(2,3;1,4)-D- 
glucan 4-glucanohydrolase. Endo- 1 ,4-D-glucanase and endoglucanase D are synonyms of 
l,4-(l,3;l,4)-D-glucan 4-glucanohydrolase. Cellulase may also hydrolyse 1,4-linkages in -D- 
glucans containing also 1,3 linkages, which makes them particularly useful for certain 
applications such as e.g. eliminating anti-nutritional factors in feed. Such enzymes which exhibit 
endo-glucanase activity are referred to in general as glucanases. In addition hydrolyis of glucan 
can be achived by lichenase (1,3-1 ,4-D-glucan glucanohydrolase (EC 3.2.1.73)) and 
endo-l,3(4)-glucanase (l,3-(l,3;l,4)-D-glucan 3(4) glucanohydrolase (EC 3.2.L6)). 

A review of cellulase activities is shown in the table below. 



EC 

number 


Activity 


Systematic Name 


Recommended 


Synonym 


3.2.1.4 


Endo Hydrolysis of 
1 ,4-P-D-glucosidic 
linkage in cellulose. 

Will also hydrolyse 
1,4-linkages in B-D- 
glucans also containing 
1,3 linkages. 


l,4-(l,3;l,4)-p-D- 
glucan 4- 
glucanohy dro lase 


cellulase 


Endo-l,4-P-D- 
glucanase 
endoglucanase D 
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3.2.1.73 


Hydrolysis 
1,4-P-D-glycosidic 
linlcages in P-D-glucans 
containing 1,3 and 1,4 
bonds. Acts on lichenin 
and cereal 

P-D-glucans, but not on 
P-D-glucans containing 
only 1,3 or only 1,4 
bonds. 


1,3-1 5 4-P-D- 
glucan 

glucanohydrolase 


Lichenase 






Hydrolysis 1,3 
P-D-glucosidic linkages 
in 1,3 P-D-glucans. 
Limited activity on 
mixed 

(1,3-1,4) P-D-glucans 


1 .3-B-D- CT lucan 
glucanohydrolase 


Glucan 

endo-l,3-P- 

glucosidase 


enao i,j p-u- 

glucanase 

P 1,3 glucanase 

Oligo 1,3 glucosidase 


3.2.1.58 


Succesive hydrolysis of 
P-D-glucose units from 
the non-reducing ends of 
1,3-P-D-glucans, 
releasing a-glucose 


1,3-P-D-glucan 
glucohydrolase 


glucan 1,3-0- 
glucosidase 


exo p- 1 ,3 -glucanase 


3.2.1.75 


Random hydrolysis of 
1,6 linkages in 
1,6-P-D-glucosides. 
acts on lutean, pustulan, 
1,6-oligo-P-D-glucosides 


1,6-P-D-glucan 
glucanohydrolase 


glucan endo- 

1,6-p- 

glucosidase 


endo- P- 1 , 6- glucanase 
P-l,6-glucan hydrolase 


3.2.1.6 


Endohydrolysis of 1,3 or 
1,4 linkages in P-D- 
glucans when the glucose 
residue whose reducing 
group is involved in the 
iiujva.gc lo Do nyuroiyseu 
is itself substituted at C- 
3. Substrates include 
cereal 

D-glucans, laminarin, 
lichenin 


l,3-(l,3;l,4)-p-D- 
glucan 3(4) 
glucanohydrolase 


endo-l,3(4)-p- 
glucanase 


p- 1,3 -glucanase 

beta- 1 ,3- 1 ,4-glucanase 

endo-p, 1,3(4)- 
glucanase 


3.2.1.91 


Hydrolysis of 
1,4-P-D-glucosidic 
linkages in cellulose and 
cellotetraose, releasing 
cellobiose from the non- 
reducing end , ij 


1 ,4-D-glucan 
cellobiohydrolase 1 


cellulose 1 4-ft- 
cellobiosidase 


exoglucanase 

cellobiohydrolase 

exo-cellobiohydrolase 

i 
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CLAIMS 

1 . A polypeptide which is a B-glucanase obtainable from a fungus, of the genus 
Talaromyces, such as the fungus Talaromyces emersonii, having the activity EC 3.2.1.4 
(endoglucanase activity). 

2. A polypeptide according to claim 1 which has: 

a. an optimum pH below 5.4; or 

b. an optimum temperature of at least 80°C, such as from 83 to 87°C. 

3. A B-glucanase polypeptide such as according to claim 1 or 2 comprising: 

(i) the amino acid sequence of SEQ ID No. 2, 4 or 6; or 

(ii) a variant of (i) which is capable of cleaving B-D-glucan; or 

(hi) a fragment of (i) or (ii) which is capable of cleaving B-D-glucan. 

4. A polypeptide according to claim 1 wherein the variant (ii) has at least 70%, 75%, 80% 
or 85% identity to the amino acid sequence of SEQ ID No. 2, 4 or 6 and/or the fragment of (iii) is 
at least 150 amino acids in length. 

5. A polypeptide according to claim 1 or 2 which cleaves (l-»3), (l-»4) and/or (l-»6) 
linkages in B-D-glucan. 

6. A polynucleotide comprising: 

(a) the nucleic acid sequence of SEQ ID No. 1, 3 or 5 or a sequence encoding a 
polypeptide according to any preceding claim; 

(b) a sequence which is complementary to, or which hybridises to, a sequence as defined 
in (a); 

(c) a fragment of a sequence in (a) or (b); 

(d) a sequence having at least 60% identity to a sequence as defined in (a), (b) 
or (c); or 

(e) a sequence that is degenerate as a result of the genetic code to any one of the sequences 
as defined in (a) to (d). 

7. A sequence according to claim 7 wherein in (b) the hybridisation is under stringent 
conditions, the fragment in (c) is at least 20 bases in length and/or the identity in (d) is at least 
70% or 80%. 

8. A polynucleotide according to claim 7 which comprises: 
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(a) a sequence that encodes a polypeptide having fl-ghicanase activity, which is: 

(1) the coding sequence of SEQ ID No. 1, 3 or 5; 

(2) a sequence which hybridises selectively to the complement of sequence defined in 
(1); or 

(3) a sequence that is degenerate as a result of the genetic code with respect to a 
sequence defined in (1) or (2); or 

(b) a sequence complementary to a polynucleotide defined in (a). 

9. A polynucleotide according to any of claims 6 to 8 which is a DNA sequence. 

10. A vector comprising a polynucleotide sequence according to any one of claims 6 to 9. 

11. A vector according to claim 10 which is an expression vector, such as where a DNA 
sequence according to claim 9 is operably linked to a regulatory sequence. 

12. A host cell which comprises, as a heterologous sequence, a polynucleotide according 
to any of claims 6 to 9. 

13. A host cell which expresses, as a heterologous protein, a polypeptide according to any 
of claims 1 to 5. 

14. A host cell transformed with the DNA sequence of claim 8 or a vector of claim 10. 

15. A process of producing a polypeptide according to any of claims 1 to 5, the process 
comprising culturing a host cell as defined in any of claims 12 to 14 under conditions that 
provide for expression of the polypeptide. 

1 6. A pharmaceutical, dental, veterinary or detergent composition comprising a 
polypeptide according to any of claims 1 to 5 and a pharmaceutically, dentally, veterinarily or 
detergent acceptable carrier. 

17. The use of a polypeptide according to any of claims 1 to 5 for the manufacture of a 
medicament for treating hyperlipemia and/or high serum cholesterol and triglyceride levels in an 
individual or a disorder resulting therefrom. 

18. A method of treating a subject for hyperlipemia and/or high serum cholesterol and 
triglyceride levels in an individual or a disorder resulting therefrom comprising administering to 
the subject polypeptide according to any of claims 1 to 5. 

19. A method of identifying a compound that modulates B-glucanase activity, the method 
comprising contacting a polypeptide according to any of claims 1 or 5 with a test compound and 
monitoring for B-glucanase activity. : 

20. A composition comprising a polypeptide according to any one of claims 1 to 5. 
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21. A composition according to claim 20 which further comprises a polypeptide having 
cellulase, endo-arabinanase, rhamnogalacturonase or polygalacturonase activity. 

22 . A method of treating plant material., the method comprising contacting the plant 
material with a polypeptide according to any one of claims 1 to 5 or a composition according to 
claim 20 or claim 21. 

23 . A method according to claim 22 wherein the treatment comprises degrading or 
modifying cellulose, such as 13-glucan, in the plant material. 

24. A method according to claim 23 for degrading or modifying plant cell walls. 

25. A method according to any of claims 22 to 24 wherein the treatment comprises 
cleaving of glucose subunits of a B-D-glucan component of the material. 

26. A method according to any of claims 22 to 24 wherein the material comprises a plant, 
plant pulp, plant extract or an edible foodstuff or ingredient therefor, or a fabric, textile or clothes 
containing plant material. 

27. A processed plant material obtainable by contacting a plant material with a polypeptide 
according to any one of claims 1 to 5 or a composition according to claims 20 or claim 20, or 
which results from a method according to any of claim 22 to 25. 

28. A method for reducing the viscosity of a plant material, the method comprising 
contacting the plant material with a polypeptide according to any one of claims 1 to 5 or a 
composition according to claim 20 or claim 21 in an amount and under conditions effective to 
degrade cellulose contained in the material. 

29. Use of a polypeptide according to any one of claims 1 to 5 or a composition according 
to claim 19 or claim 20 in a method of treating plant material. 

30. Use according to claim 28 wherein the treatment comprises cleaving B-D-glucan 
polymers in the plant material. 

31. Use of a polypeptide according to any one of claims 1 to 5 or a composition according 
to claim 20 or claim 21 in a method of processing plant pulp, juice or extract which method 
comprises incubating the pulp, juice or extract with the polypeptide or composition to at least 
partially degrade cellulose. 

32. An (animal) feed comprising a polypeptide according to any one of the claims 1 to 5. 

33. The use of a polypeptide according to any of claims 1 to 5 in brewing, distilling, 
biomethanation, dental hygiene, leather-treatment, paper manufacture, textile treatment or 
manufacture, baking or bread making, washing or detergent treatment, treating flower bulbs or in 
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an animal feed. 

34. A food or foodstuff comprisng a polypeptide according to any of claims 1 to 5. 

35. A food or foodstuff according to claim 32 which is an alcoholic beverage, bread, 
dough or tea. 

36. A transgenic organism comprising a cell according to claim 12 or 13. 

37. An organism according to claim 36 which is a plant or part thereof. 
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SEQUENCE LISTING 



<110> DSM NV 



<120> NOVEL BETA GLUCANASE 



<I30> N78454A 
<140> 

< 1 4 r> 



<150> GB 00302263.9 

<151> 2000-03-20 

<160> 8 

<170> Patentln version 3.0 



<210> 1 . 
<211> 1008 
<212> DNA 

<213> Talaromyces emersonii 

<220> 

<221> CDS 

<222> (1) . . (1008) 

<400> 1 



atg aag ttc age agg gtc gtg tgc -ggt ctg acg gcc gca ggg ggc acc 
Met Lys Phe Ser Arg Val Val' Cys Gly Leu Thr Ala Ala Gly Gly Ala 
1 5 10 



48 



15 



96 



etc gcc get cca gtc aag gag aag ggc ate aag aag egg gcg tct ccg 
Leu Ala Ala Pro Val Lys Glu Lys Gly He Lys Lys Arg Ala Ser Pro 
20 25 30 

It* Sf 3 ^ gg ttC gga tCC aac gag tct ggc Sea gag ttt ggg aac aac 144 
Phe Gin Trp Phe Gly Ser Asn Glu Ser Gly Ala Glu Phe Gly Asn Asn 
35 40 45 

aac ate cct ggc gtg ga$ ggc acc gac tac acc ttc ccc aac acg age 
Asn lie Pro Gly Val Glu Gly Thr Asp Tyr Thr Phe Pro Asn Thr Ser 
50 55 60 

g ^ c ^ C ^f g a ^ c ctc atc 9 ac ca S ggc atg aac ate ttc cgc gtg ccg 
Ala He Gin He Leu He Asp Gin Gly Met Asn He Phe Arg Val Pro 
65 70 ; - ■ 75 " 80 

Phf r Ctg ^ 2 ag ? gc atg gtg ccc aac cag atg acg ggg cc <3 gtg gat 

Phe Leu Met Glu Arg Met Val Pro Asn- Gin Met Thr Gly Pro Val Asp 
85 90 95 



192 



240 



288 
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teg gcg tat ttc cag ggc tac age cag gtt ate aac tac att ace age 336 
Ser Ala Tyr Phe Gin Gly Tyr Ser Gin Val lie Asn Tyr He Thr Ser 
100 105 110 

cat ggc gcg teg gca gtg att gac ccg cat aac ttc ggg cga tac tac 38 4 
His Gly Ala Ser Ala Val He Asp Pro His Asn Phe Gly Arg Tyr Tyr 
115 120 125 

aac aat ate ate tec teg ccg tct gac ttc cag act ttc tgg cac act 432 
Asn Asn He He Ser Ser Pro Ser Asp Phe Gin Thr Phe Trp His Thr 
130 135 140 

att gcg tec aac ttt gcg gat aat gac aat gtc att ttc gac acg aac 480 
He Ala Ser Asn Phe Ala Asp Asn Asp Asn Val He Phe Asp Thr Asn 
145 150 155 ~ 160 

aac gaa tac cac gac atg gac gaa age ctt gtc gtc cag etc aac cag 528 
Asn Glu Tyr His Asp Met Asp Glu Ser Leu Val Val Gin Leu Asn Gin 
165 170 175 

gee gee ate gac ggc ate cgc gee gcg ggc gee aca tea cag tac ate 57 6 
Ala Ala He Asp Gly He Arg Ala Ala Gly Ala Thr Ser Gin Tyr lie 
180 185 190 

ttc gtc gag ggc aac teg tgg ace ggg gee tgg aca tgg acg cag gtc 624 
Phe Val Glu Gly Asn Ser Trp Thr Gly Ala Trp Thr Trp Thr Gin Val 
195 200 205 

aac gac gcg atg gcg aac ctg acg gac ccg cag aac aag ate gtg tac 672 
Asn Asp Ala Met Ala Asn Leu Thr Asp . Pro Gin Asn Lys He Val Tyr 
210 215 220 

gag atg cac cag tac ctg gac teg gac ggg teg ggc acg teg gac cag 720 
Glu Met His Gin Tyr Leu Asp Ser Asp Gly Ser Gly Thr Ser Asp Gin 
225 230 235 240 

tgc gtc aac teg acc ate ggg cag gac' cgc gtc gag teg gcg acg gee 7 68 
Cys Val Asn Ser Thr He Gly Gin Asp Arg Val Glu Ser Ala Thr Ala 
245 250 255 

tgg ctg aag cag aac ggc aag aag gcg ate ctg ggc gag tac get ggc 816 
Trp Leu Lys Gin Asn Gly Lys Lys Ala He Leu Gly Glu Tyr Ala Gly 
260 265 270 . 

ggc gee aac age gtg tgc gag acg gec gtc acc ggc atg etc gac tat 8 64 
Gly Ala Asn Ser Val Cys Glu Thr Ala Val Thr Gly Met Leu Asp Tyr 
275 280 285 

etc gee aac aat act gat gtc tgg acc ggt get ate tgg tgg gcg get 912 
Leu Ala Asn Asn Thr Asp Val Trp Thr Gly Ala He Trp Trp Ala Ala 
290 295 300 

ggg ccg tgg tgg gga gac tat ate ttc tec atg gag ccg cct agt ggg 960 
Gly Pro Trp Trp Gly Asp Tyr He Phe Ser Met Glu Pro Pro Ser Gly 
305 310 - 315 320 

att gcg tat gag cag gtt ctg ccg ttg ctg aag ccg tac etc gaa tga 1008 
lie Ala Tyr Glu Gin Val Leu Pro Leu Leu Lys Pro Tyr Leu Glu 
325 330 335 

<210> 2 
<211> 335 
<212> *PRT 

<213> Talaromyces emersonii 
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<400> 2 

Met Lys Phe Ser Arg Val Val Cys Gly Leu Thr Ala Ala Gly Gly Ala 
1 5 in t 



15 



Leu Ala Ala Pro Val Lys Glu Lys Gly He Lys Lys Arg Ala Ser Pro 
20 25 30 

Phe Gin Trp Phe Gly Ser Asn Glu Ser Gly Ala Glu Phe Gly Asn Asn 
35 40 45 . 

Asn He Pro Gly Val Glu Gly Thr Asp Tyr Thr Phe Pro Asn Thr Ser 
50 55 60 

Ala He Gin He Leu He Asp Gin Gly Met Asn He Phe Arg Val Pro 
65 70 75 "* 80 

Phe Leu Met Glu Arg Met Val Pro Asn Gin Met Thr Gly Pro Val Asp 
85 90 95 

Ser Ala Tyr Phe Gin Gly Tyr Ser Gin Val lie Asn Tyr He Thr Ser 
100 105 HO 

His Gly Ala Ser Ala Val He Asp Pro His Asn Phe Gly Arq Tyr Tvr 
115 120 125 

Asn Asn He He Ser Ser Pro Ser Asp Phe Gin Thr Phe Trp His Thr 
130 135 140 

i i 

lie Ala Ser Asn Phe Ala Asp Asn Asp Asn Val He Phe Asp Thr Asn 
145 I 50 155 160 

Asn Glu Tyr His Asp Met Asp Glu Ser Leu Val Val Gin Leu Asn Gin 
165 170 175 

Ala Ala He Asp Gly lie Arg Ala Ala Gly Ala Thr Ser Gin Tvr He 
180 185 190 

Phe Val Glu Gly Asn Ser Trp Thr Gly Ala Trp Thr Trp Thr Gin Val 
195 200 . ~ 205 

Asn Asp Ala Met Ala Asn Leu Thr Asp Pro Gin Asn Lys He Val Tyr 
210 215 220 

Glu Met His Gin Tyr Leu Asp Ser Asp Gly Ser Gly Thr Ser Asp Gin 
225 230 235 " * 240 

Cys Val Asn Ser Thr lie Gly Gin Asp Arg Val Glu Ser Ala Thr Ala 
245 250 255 

Trp Leu Lys Gin Asn Gly Lys Lys Ala He Leu Gly Glu Tyr Ala Glv 
260 265 270 

Gly Ala Asn Ser Val Cys Glu Thr Ala Val Thr Gly Met Leu Asp Tvr" 
275 280 285 

Leu Ala Asn Asn Thr Asp Val Trp Thr Gly Ala lie Trp Trp Ala Ala 
290 295 300 

Gly Pro Trp Trp Gly Asp Tyr lie Phe Ser Met Glu Pro Pro Ser Glv 
305 310 _ 315 320 

lie Ala Tyr Glu Gin Val Leu Pro Leu "Leu Lys Pro Tyr Leu Glu 
325 330 ' 335 
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<210> 3 

<211> 1245 

<212> DMA 

<213> Talaromyces emefsonii 
<220> 

<221> CDS 

<222> (1) . . (1245) 



<400> 3 

atg gat cga att ctt gcg ctg ate ttg gtc ccc ctt gec act gtc acg 4 8 

Met Asp Arg lie Leu Ala Leu lie Leu Val Pro Leu Ala Thr Val Thr 

1 5 10 ' 15 

gca cag cag att ggc act ate ccc gag gtc cat ccc aag etc ccg aca 96 
Ala Gin Gin lie Gly Thr lie Pro Glu Val His Pro Lys Leu Pro Thr 
20 25 30 

tgg aaa tgc acg ace gag ggc ggc tgt gtc cag cag aat ace tec gtc 14 4 

Trp Lys Cys Thr Thr Glu Gly Gly Cys Val Gin Gin Asn Thr Ser Val 

35 4 0 45;" "■; 

gtg ttg gag tac ctg teg cat ccg ate cat gaa gtt gga aac age gac 192 
Val Leu Glu Tyr Leu Ser His Pro lie His Glu Val Gly Asn Ser Asp 
50 55 60 

gtc teg tgc gtg gtt tct ggc ggg ctg aac cag age etc tgt ccc aac' 240 
Val Ser Cys Val Val Ser Gly Gly Leu Asn Gin Ser Leu Cys Pro Asn 
65 70 75 80 

gaa gag gaa tgt tec aaa aac tgc gtc gtc gag ggg gee aac tac acc 28 8 

Glu Glu 'Glu Cys Ser Lys Asn Cys Val Val Glu Gly Ala Asn Tyr Thr 
85 • 90. 95 

age teg gga gtt cac aca gac ggc gat gee ctg act etc aat cag tac 336 
Ser Ser Gly Val His Thr Asp Gly Asp Ala Leu Thr Leu Asn Gin Tyr 
100 . 105 110 

gtc acg aac ggc gac cag gtc gtc acc gee teg ccg egg gtc tat etc 384 
Val Thr Asn Gly Asp Gin Val Val Thr Ala Ser Pro Arg Val Tyr Leu' 
115 ■ * 120 125 

ctg gee age gac gac gag gac ggg aat tac age atg etc cag etc etc 432 
Leu Ala Ser Asp Asp Glu Asp Gly Asn Tyr Ser Met Leu Gin Leu Leu 
130 135 140 

ggc cag gag ctg age ttt 'gac gtg gac gtc teg aaa ctg gtc tgc ggg 480 
Gly Gin Glu Leu Ser Phe Asp Val Asp Val Ser Lys Leu Val Cys Gly 
145 150 155 160 

atg aac ggc gec ttg tat etc tec gag atg gac gca teg ggc ggc cga . 528 
Met Asn Gly Ala Leu Tyr Leu Ser Glu Met Asp Ala Ser Gly Gly Arg 
165 , - - 170 175 

aac age etc aac ccg gcg ggg gca cag tat ggc tct gga tac tgt gat 57 6 

Asn Ser Leu Asn Pro Ala Gly Ala Gin Tyr Gly Ser Gly Tyr Cys Asp 
180 185 ^ ~ 190 
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gcg caa tgc ggc gtc cag ccc ttc ate aac ggc acg gtc aac acc acre 
Ala Gin Cys Gly Val Gin Pro Phe lie Asn Gly Thr Val Asn Thr Gly 



205 



teg etc ggc get tgc tgc aac gag atg gac ate tgg gaa gcg aat acc 
Ser Leu Gly Ala Cys Cys Asn Glu Met Asp lie Trp Glu Ala Asn Ala 
210 215 



220 



? fct «u C g ? g ttg act CCg cac ccg tgc a 9 c gtc acc age ate tat 

Leu Ala Thr Ala Leu Thr Pro His Pro Cys Ser Val Thr Ser He Tyr 
225 230 235 



240 



gec tgt tct ggc get gag tgc ggc tec aac ggc gtc tgc gac aag ccg 
Ala Cys Ser Gly Ala Glu Cys Gly Ser Asn Gly Val Cys Lp Lys Pro 
2 45 250 - 



255 



400 



aac cca gcg act att gaa gcg cag cat cct gac acc gcg gtg acc ttc 

Asn Pro Ala Thr He Glu Ala Gin His Pro Asp Thr Ala Val Thr Phe 

385 390 395 

teg aac ate aga tgg ggg gat ate ggg teg acg ttc cag teg taa 

Ser Asn He Arg Trp Gly Asp He Gly Ser Thr Phe Gin Ser 

405 ' 410 



<210> 4 

<211> 414 

<212> PRT 

<213> Talaromyces emersonii 



62 4 



672 



720 



768 



gga tgc gga tac aac ccg tac gcg ctg gga gac cac aac tac tac gga 
Gly Cys Gly Tyr Asn Pro Tyr Ala Leu Gly Asp His Asn Tyr Tyr Glv 
260 265 270 

ccc ggg aag acg gtc gac acg tec agg ccc ttc acc gtg gta acg cag 
Pro Gly Lys Thr Val Asp Thr Ser Arg Pro Phe Thr Val Val Thr Gin 
275 280 285 

r CtC aaC gaC aa ° aCC acg acg 399 act ctg acg gag ate cgt 

9 90 2 95 GiU Ile g 

cgt ctg tac gtc caa gac ggc aac gtg ate ggg cct teg cct age gac 
Arg Leu Tyr Val Gin Asp Gly Asn Val He Gly Pro Ser Pro Ser Lp 
305 310 315 32 £ 

tct gtc teg tea ate acg gac teg ttc tgt tec acg gtg gat tec tat 
Ser Val Ser Ser Ile Thr Asp Ser Phe Cys Ser Thr Val Asp Ser Tvr 
325 330 335 

ttc gag ccg ctt ggc ggc ctg aag gag atg ggc gag gcg ctg ggt egg 
Phe Glu Pro Leu Gly Gly Leu Lys Glu Met Gly Glu Ala Leu Glv Arc 
340 345 350 

ggg atg- gtg ctg gtg ttc age ate tgg aat gat cct ggt cag ttc ata 
Gly Met Val Leu Val Phe Ser Ile Trp Asn Asp Pro Gly Gin Phe Met 
355 • 360 365 

aac tgg etc gac age ggg aat get ggg ccc tgc aac age acc gag ggg lis? 
Asn Trp Leu Asp Ser Gly Asn Ala Gly Pro Cys Asn Ser Thr Glu Glv 
370 375 380 



816 



8 64 



912 



960 



1008 



1056 



1104 



1200 



1245 
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<400> 4 

Met Asp Arg lie Leu Ala Leu lie Leu Val Pro Leu Ala Thr Val Thr 
15 10 15 

Ala Gin Gin lie Gly Thr lie Pro Glu Val His Pro Lys Leu Pro Thr 

20 25 ' 30 

Trp Lys Cys Thr Thr" Glu Gly Gly Cys Val Gin Gin Asn Thr Ser Val 
35 40 ■ 45 

Val Leu Glu Tyr Leu Ser His Pro lie His Glu Val Gly Asn Ser Asp 
50 55 60 

Val Ser Cys Val Val Ser Gly Gly Leu Asn Gin Ser Leu Cys Pro Asn 
65 70 75 80 

Glu Glu Glu Cys Ser Lys Asn Cys Val Val Glu Gly Ala Asn Tyr Thr 
85 - 90. " 95 

Ser Ser Gly Val His Thr Asp Gly Asp Ala Leu Thr Leu Asn Gin Tyr 
100 105 110 

Val Thr Asn Gly Asp Gin Val Val Thr Ala Ser Pro Arg Val Tyr Leu 
115 120 125 

Leu Ala Ser Asp Asp Glu Asp Gly Asn Tyr Ser Met Leu Gin Leu Leu 
130 135 140 

Gly Gin Glu Leu Ser Phe Asp Val Asp Val Ser Lys Leu Val Cys Gly 
145 150 • * 155 "* 160 

Met Asn Gly Ala Leu Tyr Leu Ser Glu Met Asp Ala Ser Gly Gly Arg 
165 170 175 

Asn Ser Leu Asn Pro Ala Gly Ala Gin Tyr Gly Ser Gly Tyr Cys Asp 
180 185 "* . 190 

Ala Gin Cys Gly Val Gin Pro Phe lie Asn Gly Thr Val Asn Thr Gly 
195 200 205 

Ser Leu Gly Ala Cys Cys Asn Glu Met Asp lie Trp Glu Ala Asn Ala 
210 215 220 

Leu Ala Thr Ala Leu Thr Pro His Pro Cys Ser Val Thr Ser lie Tyr 
225 230 235 240 

Ala Cys Ser Gly Ala Glu Cys Gly Ser Asn Gly Val Cys Asp Lys Pro 
245 250 "* 255 

Gly Cys Gly Tyr Asn Pro Tyr Ala Leu Gly Asp His Asn Tyr Tyr Gly 
260 265 270 

Pro Gly Lys Thr Val Asp, Thr Ser Arg Pro Phe Thr Val Val Thr Gin 
275 280 285 

Phe Leu Thr Asn Asp Asn Thr Thr Thr Gly Thr Leu Thr Glu lie Arg 
290 295 300 

Arg Leu Tyr Val Gin Asp Gly Asn_Val lie Gly Pro Ser Pro Ser Asp 
305 310 . / j 315 320 

Ser Val Ser Ser lie Thr Asp Ser Phe Cys Ser Thr Val Asp Ser Tyr 
325 330 335 
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Phe Glu Pro Leu Gly Gly Leu Lys Glu Met Gly Glu Ala Leu-Gly Ara 
340 345 350' g ' 

Gly Met Val Leu -Val Phe Ser lie Trp Asn A S p Pro Gly Gin Phe Met 
•5 3 3 360 365 

Asn Trp Leu Asp Ser Gly Asn Ala Gly Pro Cys Asn Ser Thr Glu Gly 
J/u 375 330 J 



Asn Pro Ala Thr He Glu Ala Gin His Pro Asp Thr Ala Val Thr Phe 

390 ;j395 400 

Ser Asn He Arg Trp Gly Asp He Gly Ser Thr Phe Gin Ser 



<210> 5 

<211> 669 

<212> DNA 

<213> Talaromyces emersonii 
<220> 

<221> CDS 

<222> (1)..(669) 

<400> 5 



atg aat gtc aga get gtt gtc tct gtt tct gec ttc ttg ctq aca act 
Met Asn Val Arg Ala Val Val Ser Val Ser Ala Phe Le! Leu Thr Pro 
5 10 15 

ttg get tea gec ctg aca gga acc acc aca aca aca tgg gac tat tat 
Leu Ala Ser Ala Leu Thr Gly Thr Thr Thr Thr Thr Trp £p Cys lys 



55 . 60 

?f C £ Ct ? cc a< 3° « cc t9 c ca g gga ggc gat gec tac age tac tec 

Ala Ser Ala Pro Ser Ala Cys Gin Gly Gly Isp Ila Tyr sir cjs Ser 

70 j 75 J 80 

gac ttc cag ccc ate ate ate agt gac aca tta tea tac arr* n-- 
Asp Phe Gin Pro lie lie lie Ser Isp Th? Se? xyr l?y Phe K 

85 90 95 

ggc aac tgg gag aca age aac tgc tgc aag tqc ttc caa ttr anr- 
Gly Asn Trp Glu Thr Ser Asn C?s Cys Lys C?s Pne til Pne Sr Tr? 

100 . '105 no p 

acg teg ggg gC g ggt gcg ggc aag tec atg ate gtc caa gtt ate aat 
Thr Ser Gly Ala Gly Ala Gly Lys Ser Met He Val Gin Val Val Asn 



48 



96 



144 



aaa cca gcg tgt age tgg acg caa aat gec caa gca ggc gaa aca aat 
Lys Pro Ala Cys Ser Trp Thr Gin Asn Ala Gin lla G?y Gly HI £ . 

40 45 

HI 3£ tl ffi £ £ JS «= »= S* SI £; ffi £ l S x» 



240 



283 



336 



384 
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115 120 125 

tct ggc ggg gtc agt aca ggc gat ttc gac att tac acg cct ggt ggc 4 32 

Ser Gly Gly Val Ser Thr Gly Asp Phe Asp lie Tyr Thr Pro Gly Gly 
130 . 135 140 

ggt gtt gga gat tac aat gcc tgc act teg cag tat ggc gcg cca cca 4 80 

Gly Val Gly Asp Tyr Asn Ala Cys Thr Ser Gin Tyr Gly Ala Pro Pro 
145 ^ " 150 " . 155 "* 160 

cag gga tgg ggt get caa tac ggc ggc gtc tec age gac get gaa tgc 528 
Gin Gly Trp Gly Ala Gin Tyr Gly Gly Val Ser Ser Asp Ala Glu Cys 
165 170 175 

gac caa etc ccc teg ate ctg cag cct gga tgc eac tgg cgt ttc gag 57 6 

Asp Gin Leu Pro Ser lie Leu Gin Pro Gly Cys His Trp Arg Phe Glu 
180 185 190 

tgg gca gga ggt ggc ate aac gga tgg acg act gag tac gag gaa gtc 624 
Trp Ala Gly Gly Gly lie Asn Gly Trp Thr Thr Glu Tyr Glu Glu Val 
195 ~ 200 205 

gat tgc cca age cag ctt act tec -ate tea ggt tgc tat ccg tga 669 
Asp Cys Pro Ser Gin Leu Thr Ser lie Ser Gly Cys Tyr Pro 
210 215 220 

<210> 6 

<211> 222 

<212> PRT 

<213> Talaromyces emersonii 



<400> 6 

Met Asn Val .Arg Ala Val Val Ser Val Ser Ala Phe Leu Leu Thr Pro 
15 10 15 

Leu Ala Ser Ala Leu Thr Gly Thr Thr Thr Thr Thr Trp Asp Cys Cys 
20 "25 30 

Lys Pro Ala Cys Ser Trp Thr Gin Asn Ala Gin Ala Gly Gly Ala Ser 

35 40 !; 45 

Gly Thr Val Ala Thr Cys Asn lie Asn Asn Gin Val Leu Ser Asn Gly 
50 55 60 

Ala Ser Ala Pro Ser Ala Cys Gin Gly Gly Asp Ala Tyr Ser Cys Ser 
65 70 "* 75 80 

Asp Phe Gin Pro He He lie Ser Asp Thr Leu Ser Tyr Gly Phe Ala 

85 t 90 / ' ~ 95 

Gly Asn Trp Glu Thr Ser Asn Cys Cys Lys Cys Phe Gin Phe Thr Trp. 
100 105 110 

Thr Ser Gly Ala Gly Ala Gly Lys Ser Met He Val Gin Val Val Asn 
115 120- - - . ■ 125 

Ser Gly Gly Val Ser Thr Gly Asp Phe Asp lie Tyr Thr Pro Gly Gly 
130 135 140 
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Gly Val Gly Asp Tyr Asn Ala Cys 
145 . 150 

Gin Gly Trp Gly Ala Gin Tyr Gly 
•165 

Asp Gin Leu Pro Ser lie Leu Gin 
180 

Trp Ala Gly Gly Gly lie Asn Gly 

195 200 

Asp Cys Pro Ser Gin Leu Thr Ser 
210 215 



Thr Ser Gin Tyr Gly Ala Pro Pro 
155 160 

Gly Val Ser Ser Asp Ala Glu Cys 
170 175 

Pro Gly Cys His Trp Arg Phe Glu 
185 190 

Trp Thr Thr Glu Tyr Glu Glu Val 
205 

lie Ser Gly Cys Tyr Pro 
220 



<210> 7 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 
<400> 7 

tatagcgaaa tggattgatt gtacgctc 



<210> 8 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer ■ 
<400> 8 

atccccagca tcattacacc tcagtg 
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